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The present invention relates to novel anti-angiogenic proteins, related t o thr ombospondin. More specifically, isolated nucleic acid 
molecules are provided encoding human METH1 and METH2. METH1 and METH2 polypeptides are also provided, as are vectors, host 
cells and recombinant methods fox producing the same. Also provided are diagnostic methods for the prognosis of cancer and therapeutic 
methods for treating individuals in need of an increased amount of METH1 or METH2. 
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Background of the Invention 

Federally-Sponsored Research and Development 

Part of the work performed during development of this invention utilized 
U.S. Government funds. The U. S. Government has certain rights in this invention. 

Field of the Invention 

The present invention relates to novel anti-angiogenic proteins, related to 
thrombospondin. More specifically, isolated nucleic acid molecules are provided 
encoding human METH1 and METH2 (ME, for metalloprotease, and TH, for 
thrombospondin). METH1 and METH2 polypeptides are also_ provided, as are 
vectors, host cells and recombinant methods for producing the same. Also 
provided are diagnostic methods for the prognosis of cancer and therapeutic 
methods for treating individuals in need of an increased amount of METH1 or 
METH2. 

Related Art 

Angiogenesis, the formation of new blood vessels from pre-existing 
vasculature, is a tightly regulated process in normal adults. Under physiological 
circumstances, growth of new capillaries is tightly controlled by an interplay of 
growth regulatory proteins which act either to stimulate or to inhibit blood vessel 
growth. Normally, the balance between these forces is tipped in favor of inhibition 
and consequently blood vessel growth is restrained. Under certain pathological 
circumstances, however, local inhibitory controls are unable to restrain the 
increased activity of angiogenic inducers. Angiogenesis is a key step in the 
metastasis of cancer (Folkman, Nature Med. l\21-2\ (1995)) and in abnormal 
wound healing, inflammation, rheumatoid arthritis, psoriasis, and diabetic 
retinopathy, it is integral to the pathology (Folkman et al„ Science 2J5:442-447 
(1 987)), engendering the hope that these pathological entities could be regulated 
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by pharmacological and/or genetic suppression of blood vessel growth (Iruela- 
Arispee/a/., Thromb. Haem. 78.612-611 1997)). 

Thrombospondin-1 (TSP-1) is a 450 kDa, anti-angiogenic adhesive 
glycoprotein released from activated platelets and secreted by growing cells 
(reviewed in Adams, Int. J. Biochem. Cell Biol 29:861-865 (1997)). TSP-1 is a 
homotrimer, with each subunit comprised of a 1152 amino acid residue 
polypeptide, post-translationally modified by AMinked glycosylation and beta- 
hydroxylation of asparagine residues. 

TSP-1 protein and mRNA levels are regulated by a variety of factors. 
TSP-1 protein levels are* downregulated by IL-1 alpha and TNF alpha. TSP-1 
mRNA and protein levels are upregulated by polypeptide growth factors including 
PDGFrTGF-beta, and bFGF (Bornstein, Faseb-J. 6:-3290-3299 (1992)) and are — — 
also regulated by the level of expression of the p53 tumor suppressor gene product 
(Damerone/a/., Science 265:1582-1584 (1994)). At least four other members 
of the thrombospondin family have been identified: TSP-2, TSP-3, TSP-4, and 
TSP-5 (also called COMP). There is a need in the art to identify other molecules 
involved in the regulation of angiogenesis. 

Summary of the Invention 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding the METH1 polypeptide having the amino acid 
sequence shown in SEQ ID NO:2 or the amino acid sequence encoded by the 
cDNA clone deposited in a bacterial host as ATCC Deposit Number 209581 on 
January 15, 1998. 

The present invention also provides isolated nucleic acid molecules 
comprising a polynucleotide encoding the METH2 polypeptide having the amino 
acid sequence shown in SEQ ID NO:4 or the amino acid sequence encoded by the 
cDNA clone deposited in a bacterial host as ATCC Deposit Number 209582 on 
January 15, 1998. 
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The present invention also relates to recombinant vectors, which include 
the isolated nucleic acid molecules of the present invention, and to host cells 
containing the recombinant vectors, as well as to methods of making such vectors 
and host cells and for using them for production of METH1 or METH2 
5 polypeptides or peptides by recombinant techniques. 

The invention further provides an isolated METH1 or METH2 polypeptide 
having an amino acid sequence encoded by a polynucleotide described herein. 

The invention further provides a diagnostic method useful during diagnosis 
or prognosis of cancer. 

[0 An additional aspect of the invention is related to a method for treating an 

individual in need of an increased level of METH1 or METH2 activity in the body 
comprising administering to such an individual a composition comprising a 
therapeutically effective amount of an isolated METH1 or METH2 polypeptide 
of the invention or an agonist thereof. 

15 Brief Description of the Figures 

Figure 1 shows the nucleotide (SEQ ED NO:l) and deduced amino acid 
(SEQ ID NO:2) sequences of METHL The protein has a predicted leader 
sequence of about 28 amino acid residues (underlined). 

20 Figure 2 shows the nucleotide (SEQ ID NO:3) and deduced amino acid 

(SEQ ID NO:4) sequences of METH2. The protein has a predicted leader 
sequence of about 23 amino acid residues (underlined). 

Figure 3 shows a comparison of the amino acid sequence of METH 1 (SEQ 
ID NO:2) and METH2 (SEQ ID NO:4) with that of their closest homologue, a 

25 bovine metalloprotease (pNPl) (SEQ ID NO:5). Identical amino acids are boxed. 

Functional domains predicted by sequence and structural homology are labeled, 
including the signal peptide (single line), the potential cleavage site for mammalian 
subtilisin (double underlined), the zinc-binding-site (dotted line) in the 
metalloprotease domain, and the putative disintegrin loops (arrows). 
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Figure 4 shows the primary structure of METH1, METH2 and pNPI 
which includes a prodomain, a catalytic metalloprotease domain, a cysteine rich 
disintegrin domain, a TSP-like domain, a spacer region and a different number of 
TSP-like domains, three for METH1, two for METH2, and four for pNPI. 
5 Figure 5 shows a comparison of the TSP-like domain of METH1 (SEQ ID 

NO:2) and METH2 (SEQ ID NO:4) with those of TSP1 (SEQ ID NOs:6, 7, and 
8) and TSP2 (SEQ ID NOs:9, 10, and 11), cysteines are numbered 1 to 6, 
tryptophans are marked by asterisks. 

Figure 6 shows that peptides and recombinant protein derived from the 

10 TSP-like domain of METH1 and METH2 block VEGF-induced angiogenesis. 

Angiogenesis was induced on CAMs from 12-14-day-old embryos using a nylon 
mesh containing VEGF casted on matrigel and in the presence or absence of the 
peptides or recombinant protein. Capillary density was evaluated as described in 
Example 4. Positive and negative control included VEGF alone and vehicle alone, 

15 respectively. (A) Quantification of the angiogenic response induced by VEGF in 

the presence of recombinant proteins. TSP1, purified platelet TSP1, GST, 
purified GST, GST-TSP1, GST-METH1, and GST-METH2 are described in 
Example 4. (B) Quantification of the angiogenic response induced by VEGF in 
the presence or absence of the peptides; P-TSP1, P-METH1, and P-METH2 

20 (peptide derived from the Type I repeats of TSP, METH1 and METH2, 

respectively); SCI and SC2 are scramble peptides used as controls. (C) Dose- 
response of the VEGF-induced angiogenesis in the presence of GST-METH1. 
(D) Dose-response of the VEGF-induced angiogenesis in the presence of GST- 
METH2. The angiogenic index was expressed considering the vascular response 

25 from the VEGF-matrigel as 1 00% and subtracting the background levels (matrigel 

alone). Assays were repeated, at least, twice. Each treatment was done in 
triplicate. Values represent the mean, bars indicate standard deviations. * p<0. 00 1 . 

Figure 7 shows the effect of METH1 and METH2 recombinant proteins 
on bFGF-stimulated cell proliferation. Cells were cultured on 24-well plates in 

30 media containing bFGF and the recombinant protein to be tested (3 ^ig/ml, unless 
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indicated in the graph). Controls included vehicle or GST recombinant protein 
alone. (A), HDEC, human dermal endothelial cells; (B), HMEC, human mammary 
epithelial cells; (C), HDF, human dermal fibroblasts; (D), SMC, smooth muscle 
cells; (E) Dose-response of GST-METH1 and GST-METH2 on HDEC 
proliferation. Experiments were repeated, at least, twice. Each treatment was 
done in triplicate. Values represent the mean, bars indicate standard deviations. 
*p<0.01. 

Figure 8 shows a schematic representation of the pHE4-5 expression 
vector (SEQ ID NO: 12) and the subcloned METH1 or METH2 cDNA coding 
sequence. The locations of the kanamycin resistance marker gene, the METH1 
or METH2 coding sequence, the oriC sequence, and the laclq coding sequence 
are indicated— — - — - • 

Figure 9 shows the nucleotide sequence of the regulatory elements of the 
pHE promoter (SEQ H> NO: 13). The two lac operator sequences, the 
Shine-Delgarno sequence (S/D), and the terminal Hindlll and Ndel restriction 
sites (italicized) are indicated. 

Figure 10 shows an analysis of the METH1 amino acid sequence. Alpha, 
beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown, and 
all were generated using the default settings. In the "Antigenic Index or 
Jameson-Wolf' graph, the positive peaks indicate locations of the highly antigenic 
regions of the METH1 or METH2 protein, i.e., regions from which epitope- 
bearing peptides of the invention can be obtained. The domains defined by these 
graphs are contemplated by the present invention. Tabular representation of the 
data summarized graphically in Figure 1 0 can be found in Table 1 . 

Figure 1 1 shows an analysis of the METH2 amino acid sequence. Alpha, 
beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown, and 
all were generated using the default settings. In the "Antigenic Index or 
Jameson-Wolf graph, the positive peaks indicate locations of the highly antigenic 
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regions of the METH1 or METH2 protein, i.e., regions from which epi tope- 
bearing peptides of the invention can be obtained. The domains defined by these 
graphs are contemplated by the present invention. Tabular representation of the 
data summarized graphically in Figure 1 1 can be found in Table 2. 
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Detailed Description 

By screening cDNA libraries with cDNA encoding the anti-angiogenic 
domain of TSP-l, the present inventors have identified two novel proteins, 
METH1 and METH2 (also called VEGA-1 and VEGA-2, respectively, for 
vascular endothelial growth antagonist) which contain the anti-angiogenic domain 
of TSP- 1 , a metalloproteinase domain, and a disintegrin-Iike domain. The present 
inventors have demonstrated that both METH1 and METH2 have anti-angiogenic 
activity. 

Thus, the present invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a METH1 polypeptide having the amino 
acid sequence shown in SEQ ED NO;2, which was~determined "by sequencing a 
cloned cDNA. The METH1 protein of the present invention shares sequence 
homology with thrombospondin- 1 and pNPI. The nucleotide sequence shown in 
SEQ ID NO:l was obtained by sequencing a cDNA clone, which was deposited 
on January 15, 1 998 at the American Type Culture Collection, 10801 University 
Boulevard, Manassas, Virginia 201 1 0-2209, and given accession number 20958 1 . 
The cDNA clone contained in ATCC Deposit No. 209581 contains a METH1 
sequence, encoding amino acids 1 to 950 of SEQ ID NO:2. 

The present invention also provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a METH2 polypeptide having the amino 
acid sequence shown in SEQ ED NO:4, which was partially determined by 
sequencing a cloned cDNA. The METH2 protein of the present invention shares 
sequence homology with thrombospondin- 1 and pNPI. The nucleotide sequence 
shown in SEQ ID NO:3 was partially obtained by sequencing a cDNA clone, 
which was deposited on January 15, 1998 at the American Type Culture 
Collection, 10801 University Boulevard, Manassas, Virginia 20110-2209, and 
given accession number 209582. The cDNA clone contained in ATCC Deposit 
No. 209582 contains a partial METH2 sequence, encoding amino acids 1 12-890 
of SEQIDNO:4. 
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Nu cleic A cid Molecules 

Some of the nucleotide sequences determined by sequencing a DNA 
molecule herein were determined using an automated DNA sequencer (such as the 
Model 373 from Applied Biosystems, Inc.), and all amino acid sequences of 
polypeptides encoded by DNA molecules determined herein were predicted by 
translation of a DNA sequence determined as above. Therefore, as is known in 
the art for any DNA sequence determined by this automated approach, any 
nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, 
more typically at least about 95% to at least about 99.9% identical to the actual 
nucleotide sequence of the sequenced -DNA molecule. The actual sequence can - - 
be more precisely determined by other approaches including manual DNA 
sequencing methods well known in the art. As is also known in the art, a single 
insertion or deletion in a determined nucleotide sequence compared to the actual 
sequence will cause a frame shift in translation of the nucleotide sequence such 
that the predicted amino acid sequence encoded by a determined nucleotide 
sequence will be completely different from the amino acid sequence actually 
encoded by the sequenced DNA molecule, beginning at the point of such an 
insertion or deletion. 

Using the information provided herein, such as the nucleotide sequence in 
SEQ ID NO: 1 or SEQ ID NO:3, a nucleic acid molecule of the present invention 
encoding a METH1 or METH2 polypeptide may be obtained using standard 
cloning and screening procedures,.such as those for cloning cDNAs using mRNA 
as starting material. Illustrative of the invention, the nucleic acid molecule 
described in SEQ ED NO: 1 was discovered in a cDNA library derived from human 
heart and the nucleic acid molecule described in SEQ ID NO:3 was discovered in 
a cDNA library derived from human lung. The determined nucleotide sequence 
of the METH1 cDNA of SEQ ID NO: 1 contains an open reading frame encoding 
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a protein of about 950 amino acid residues, including a predicted leader sequence 
of about 28 amino acid residues. The present inventors have determined that the 
nucleotide sequence of the METH2 cDNA of SEQ ID NO:3 contains an open 
reading frame encoding a protein of about 890 amino acid residues, including a 
predicted leader sequence of about 23 amino acid residues. 

The present invention also provides the mature form(s) oftheMETHl and 
METH2 proteins of the present invention. According to the signal hypothesis, 
proteins secreted by mammalian cells have a signal or secretory leader sequence 
which is cleaved from the mature protein once export of the growing protein chain 
across the rough endoplasmic reticulum has been initiated. Most mammalian cells 
and even insect cells cleave secreted proteins with the same specificity. However, 
in some cases, cleavage of a secreted protein is not entirely uniform, which results 
. in two or moreTmature species on the protein. Further, it has long been known 
that the cleavage specificity of a secreted protein is ultimately determined by the 
primary structure of the complete protein, that is, it is inherent in the amino acid 
sequence of the polypeptide. Therefore, the present invention provides a 
nucleotide sequence encoding the mature METH1 polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in the host identified as 
ATCC Deposit No. 209581 and as shown in SEQ ID NO:2. The present 
invention also provides a nucleotide sequence encoding the mature METH2 
polypeptide having the amino acid sequence as shown in SEQ ED NO:4. By the 
mature METH1 protein having the amino acid sequence encoded by the cDNA 
clone contained in the host identified as ATCC Deposit No. 209581 is meant the 
mature form(s) of the METH1 protein produced by expression in a mammalian 
cell (e.g., COS cells, as described below) of the complete open reading frame 
encoded by the human DN A sequence of the clone contained in the vector in the 
deposited host. As indicated below, the mature METH1 having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 209581 
may or may not differ from the predicted "mature" METH1 protein shown in SEQ 
ID NO:2 (amino acids from about 29 to about 950) depending on the accuracy of 
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the predicted cleavage site based on computer analysis; and the mature METH2 
may or may not differ from the predicted "mature" METH2 protein shown in SEQ 
H> NO: 4 (amino acids from about 24 to about 890) depending on the accuracy 
of the predicted cleavage site based on computer analysis. 

Methods for predicting whether a protein has a secretory leader as well as 
the cleavage point for that leader sequence are available. For instance, the 
methods of McGeoch (Virus Res. 3:271-286 (1985)) and von Heinje (Nucleic 
Acids Res. 7^:4683-4690 (1986)) can be used. The accuracy of predicting the 
cleavage points of known mammalian secretory proteins for each of these methods 
is in the range of 75-80%. von Heinje, supra. However, the two methods do not 
always produce the same predicted cleavage point(s) for a given protein. 

In the present case, the predicted amino acid sequence of the complete 
- METH1 and-METH2 polypeptides of the present invention were analyzed by-a~ 
computer program ("PSORT*')(K.Nakai andM.Kanehisa, Genomics 74:897-91 1 
(1992)), which is an expert system for predicting the cellular location of a protein 
based on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGeoch and von Heinje are incorporated. The 
analysis by the PSORT program predicted the cleavage site between amino acids 
28 and 29 in SEQ ID NO:2 and amino acids 23 and 24 in SEQ ID NO:4. 
Thereafter, the complete amino acid sequences were further analyzed by visual 
inspection, applying a simple form of the (-1,-3) rule of von Heinje. von Heinje, 
supra. Thus, the leader sequence for the METH1 protein is predicted to consist 
of amino acid residues from about 1 to about 28 in SEQ ID NO: 2, while the 
mature METH1 protein is predicted to consist of residues from about 29 to about 
950; and the leader sequence for the METH2 protein is predicted to consist of 
amino acid residues from about 1 to about 23 in SEQ ID NO:4, while the mature 
METH2 protein is predicted to consist of residues from about 24 to about 890. 
An alternative predicted mature METH1 protein consists of residues 30 to 950 in 
SEQ ED NO:2. 
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As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors, as well as the variability of cleavage sites for leaders in different 
known proteins, the predicted METH1 polypeptide encoded by the deposited 
cDNA comprises about 950 amino acids, but may be anywhere in the range of 
910-990 amino acids; and the predicted leader sequence of this protein is about 
28 amino acids, but may be anywhere in the range of about 1 8 to about 38 amino 
acids. Also, the predicted METH2 polypeptide comprises about 890 amino acids, 
but may be anywhere in the range of 850 to about 930 amino acids; and the 
predicted leader sequence of this protein is about 23 amino acids, but may be 
anywhere in the range of about 13 to about 33 amino acids. 

As indicated, nucleic acid molecules of the present invention may be in the 
form of RNA, such as mRNA, or in the form of DNA, including, for instance, 
cDNA and genomic DNA obtained by cloning or produced synthetically. The 
DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA 
may be the coding strand, also known as the sense strand, or it may be the 
non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. For 
example, recombinant DNA molecules contained in a vector are considered 
isolated for the purposes of the present invention. Further examples of isolated 
DNA molecules include recombinant DNA molecules maintained in heterologous 
host cells or purified (partially or substantially) DNA molecules in solution. 
Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA 
molecules of the present invention. Isolated nucleic acid molecules according to 
the present invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) shown in SEQ ID NO:l; 
DNA molecules comprising the coding sequence for the mature METH 1 protein; 
and DNA molecules which comprise a sequence substantially different from those 
described above but which, due to the degeneracy of the genetic code, still encode 
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the METH1 protein. Also included are DNA molecules comprising an open 
reading frame (ORF) shown in SEQ ID NO:3; DNA molecules comprising the 
coding sequence for the mature METH2 protein; and DNA molecules which 
comprise a sequence substantially different from those described above but which, 
due to the degeneracy of the genetic code, still encode the METH2 protein. Of 
course, the genetic code is well known in the art. Thus, it would be routine for 
one skilled in the art to generate such degenerate variants. 

In another aspect, the invention provides isolated nucleic acid molecules 
encoding the METH1 or METH2 polypeptides having an amino acid sequence as 
encoded by the cDNA clones contained in the plasmids deposited as ATCC 
Deposit No. 209581 on January 15, 1998 or ATCC Deposit No. 209582 on 
January 1 5, 1 998, respectively. In a further embodiment, nucleic acid molecules 

are provided-encoding the mature METH-1 or METH2 polypeptide or the full- — 

length METH1 or METH2 polypeptide lacking the N-terminal methionine. The 
invention also provides an isolated nucleic acid molecule having the nucleotide 
sequence shown in SEQ ID NO: 1 or SEQ ID NO:3 or the nucleotide sequence of 
theMETHl or METH2 cDN A contained in the above-described deposited clones, 
or a nucleic acid molecule having a sequence complementary to one of the above 
sequences. Such isolated molecules, particularly DNA molecules, are useful as 
probes for gene mapping, by in situ hybridization with chromosomes, and for 
detecting expression of the METH1 or METH2 gene in human tissue, for 
instance, by Northern blot analysis. 

The present invention is further directed to fragments of the isolated 
nucleic acid molecules described herein. By a fragment of an isolated nucleic acid 
molecule having the nucleotide sequence of the deposited cDNA or the nucleotide 
sequence shown in SEQ ID NO: 1 or SEQ ID NO:3 is intended fragments at least 
about 1 5 nt, and more preferably at least about 20 nt, still more,preferably at least 
about 30 nt, and even more preferably, at least about 40 nt in length which are 
useful as diagnostic probes and primers as discussed herein. Of course, larger 
fragments 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 
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750, 800, 850, 900, 950, 1000, 1050, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, 
1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900, or 
3000 nt in length are also useful according to the present invention as are 
fragments corresponding to most, if not all, of the nucleotide sequence of the 
deposited cDNA or as shown in SEQ ID NO: 1 or SEQ ID NO:3 . By a fragment 
at least 20 nt in length, for example, is intended fragments which include 20 or 
more contiguous bases from the nucleotide sequence of the deposited cDNA or 
the nucleotide sequence as shown in SEQ ID NO;l or SEQ ID NO:3. 

Preferred nucleic acid fragments of the present invention include nucleic 
acid molecules encoding epitope-bearing portions of the METH1 or METH2 
protein. Methods for determining epitope-bearing portions of the METH 1 and 
METH2 proteins are described in detail below. 

Other prejferreinucieic_acid fragments of the present invention, include 

nucleic acid molecules encoding: the metalloprotease domain of METHl, amino 
acids 235 to 459 in SEQ ED NO:2; the disintegrin domain of METHl, amino 
acids 460 to 544 in SEQ ID NO:2; the first TSP-like domain of METH1, amino 
acids 545 to 598 in SEQ ID NO:2; the second TSP-like domain of METHl , amino 
acids 841 to 894 in SEQ ID NO:2; the third TSP-like domain of METHl, amino 
acids 895 to 934 in SEQ ID NO:2; amino acids 536 to 613 in SEQ ID NO:2; 
amino acids 549 to 563 in SEQ ID NO:2; the metalloprotease domain of METH2, 
amino acids 214 to 439 in SEQ ID NO:4; the disintegrin domain of METH2, 
amino acids 440 to 529 in SEQ ID NO:4; the first TSP-like domain of METH2, 
amino acids 530 to 583 in SEQ ID NO:4; the second TSP-like domain of METH2, 
amino acids 837 to 890 in SEQ ID NO:4; amino acids 280 to 606 in SEQ ID 
NO:4; and amino acids 529 to 548 in SEQ ID NO:4. 

In addition, the present inventors have identified the following cDNA 
clones related to portions of the sequence shown in SEQ ID NO: 1 : HOUCQ 1 7RA 
(SEQIDNO:14), HPLBM1 1R (SEQ ID NO:15), HGBI07R (SEQ ED NO:16), 
HNTMA49R (SEQ ID NO: 1 7), HNALE27R (SEQ ID NO: 1 8), and HIBDB45R 
(SEQ ID NO: 19). 
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The following public ESTs, which relate to portions of SEQ ID NO.l, 
have also been identified: D67076 (SEQ ID NO:20), AB001735 (SEQ ID 
NO:21), XI 4787 (SEQ ID NO:22), U64857 (SEQ IDNO:23), X04665 (SEQ ID 
NO:24), M64866 (SEQ ID NO:25), L07803 (SEQ ID NO:26), U08006 (SEQ ID 
5 NO:27), Ml 6974 (SEQ ID NO:28), LI 3855 (SEQ ID NO:29), AL021529 (SEQ 

ID NO:30), D86074 (SEQ ID NO:3 1), L05390 (SEQ ID NO:32), Z69361 (SEQ 
ID NO:33), X99599 (SEQ ID NO:34), AF018073 (SEQ ID NO:35), L23760 
(SEQ ID NO:36), Z46970 (SEQ ID NO:37), AC004449 (SEQ ID NO:38), 
Z69589 (SEQ ID NO:39), Z22279 (SEQ ID NO:40), and X17524 (SEQ ID 

10 NO:41). 

The present inventors have also identified the following cDNA clones 
related to portions of SEQ ID NO:3: HCE4D69FP02 (SEQ ID NO:42), 

HB3DB45F (SEQ ID NO:43), HKIXH64R (SEQ ID NO:44), HIBDB45R (SEQ— 

ID NO: 19), HCE3Z95R (SEQ ID NO:45), HTLEQ90R (SEQ ID NO:46), 

15 HMWEF45R (SEQ ID NO:47), HTOFC34RA (SEQ ID NO:48), HHFDI20R 

(SEQ ID NO:49), HMSHY47R(SEQIDNO:50), HCESF90R (SEQ ED NO:5 1), 
HMCA046R (SEQ ED NO:52), HTTAQ67R (SEQ ID NO:53), HFKCF19F 
(SEQ ID NO:54), HMCAS31R (SEQ ID NO:55), HMWGP26R (SEQ ID 
NO:56), HLHTP36R (SEQ ID NO:57), HE8AN11R (SEQ ID NO:58), 

20 HEONN73R(SEQ ED NO: 59), HBNBG53R (SEQ ID NO:60), and HMSCH94R 

(SEQ ED NO:61). 

The following public ESTs, which relate to portions of the sequence 
shown in SEQ ED NO:3, have also been identified: D67076 (SEQ ED NO:20), 
AB00I735 (SEQ ID NO:21), AB005287 (SEQ ID NO:62), X87619 (SEQ ED 

25 NO:63), X14787 (SEQ ED NO:22), X04665 (SEQ ID NO:24), M87276 (SEQ ED 

NO:64), M62458 (SEQ ED NO:65), AB002364 (SEQ ED NO:66), AB005297 
(SEQ ID NO:67), X69161 (SEQ ED NO:68), XI 6619 (SEQ ED NO:69), 136448 
(SEQ ED NO:70), L12260 (SEQ ID NO:71), 136352 (SEQ ID NO:72), XI 5898 
(SEQ ED NO:73), 107789 (SEQ ID NO:74), 108144 (SEQ ED NO:75) U3 1814 

30 (SEQ ID NO: 76), and AF001444 (SEQ ED NO: 77). 
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In specific embodiments, the polynucleotides of the invention are less than 
300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further 
embodiment, polynucleotides of the invention comprise at least 15 contiguous 
nucleotides of METH1 or METH2 coding sequence, but do not comprise all or 
a portion of any METH1 or METH2 intron. In another embodiment, the nucleic 
acid comprising METH1 or METH2 coding sequence does not contain coding 
sequences of a genomic flanking gene (i.e., 5' or 3' to the METH1 or METH2 
gene in the genome); 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of the polynucleotide in a nucleic acid molecule of the 
invention described above, for instance, the cDNA clones contained in ATCC 
Deposit No. 209581 or ATCC Deposit No. 209582. By "stringent hybridization 
conditions" is intended overnight incubation at 42° C in a solution comprising: 
50% formamide, 5x SSC (750 mM NaCl, 75mM trisodium citrate), 50 mM 
sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 
20 ^g/ml denatured, sheared salmon sperm DNA, followed by washing the filters 
in 0.1 x SSC at about 65 °C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide 
is intended a polynucleotide (either DNA or RNA) hybridizing to at least about 
1 5 nucleotides (nt), and more preferably at least about 20 nt, still more preferably 
at least about 30 nt, and even more preferably about 30, 40, 50, 60 or 70 nt of the 
reference polynucleotide. These are useful as diagnostic probes and primers as 
discussed above and in more detail below. 

By a portion of a polynucleotide of "at least 20 nt in length," for example, 
is intended 20 or more contiguous nucleotides from the nucleotide sequence of the 
reference polynucleotide (e.g., the deposited cDNAs or the nucleotide sequence 
as shown in SEQ ID NO: 1 or SEQ ED NO:3). Of course, a polynucleotide which 
hybridizes only to a poly A sequence (such as the 3 ' terminal poly(A) tract of the 
METH1 or METH2 cDNA shown in SEQ ID NO:l and SEQ ID NO:3, 



WO 99/37660 



-41- 



PCT/US99/01313 



respectively) or to a complementary stretch of T (or U) resides, would not be 
included in a polynucleotide of the invention used to hybridize to a portion of a 
nucleic acid of the invention, since such a polynucleotide would hybridize to any 
nucleic acid molecule containing a poly (A) stretch or the complement thereof 
5 (e.g., practically any double-stranded cDNA clone). 

Also contemplated are nucleic acid molecules that hybridize to the METH 1 
or METH2 polynucleotides at moderately high stringency hybridization 
conditions. Changes in the stringency of hybridization and signal detection are 
primarily accomplished through the manipulation of formamide concentration 

10 (lower percentages of formamide result in lowered stringency); salt conditions, or 

temperature. For example, moderately high stringency conditions include an 
overnight incubation at 37 degree C in a solution comprising 6X SSPE (20X 
SSPE =-3M-NaCl;--0.2M NaH 2 P0 4 ; 0.02M.EDTA r -pH-3.4), 0.5% SDS, 30% 
formamide, 100 jig/ml salmon sperm blocking DNA; followed by washes at 50° C 

15 with 1 XSSPE, 0.1% SDS. In addition, to achieve even lower stringency, washes 

performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through 
the inclusion and/or substitution of alternate blocking reagents used to suppress 

20 background in hybridization experiments. Typical blocking reagents include 

Denhardt f s reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific 
blocking reagents may require modification of the hybridization conditions 
described above, due to problems with compatibility. 

25 Of course, a polynucleotide which hybridizes only to polyA+ sequences 

(such as any 3 1 terminal polyA+ tract of a cDNA shown in the sequence listing), 
or to a complementary stretch of T (or U) residues, would not be included in the 
definition of "polynucleotide," since such a polynucleotide would hybridize to any 
nucleic acid molecule containing a poly (A) stretch or the complement thereof 

30 (e.g., practically any double-stranded cDNA clone). 
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The METH1 or METH2 polynucleotide can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, METH1 or METH2 
polynucleotides can be composed of single- and double-stranded DNA, DNA that 
is a mixture of single- and double-stranded regions, single- and double-stranded 
RNA, and RNA that is mixture of single- and double-stranded regions, hybrid 
molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded or a mixture of single- and double-stranded regions. In 
addition, the METH1 or METH2 polynucleotides can be composed of triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. METH1 or 
METH2 polynucleotides may also contain one or more modified bases or DNA 
or RNA backbones modified for stability or for other reasons. "Modified" bases 
include, fqrexample, trit ylated bases and unusual bases such as inosine. A variety 
of modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metabolically modified forms. 

"SEQ ID NO: 1 11 refers to a METH1 polynucleotide sequence while "SEQ 
ED NO:2" refers to a METH1 polypeptide sequence. "SEQ ID NO:3" refers to 
a METH2 polynucleotide sequence while "SEQ ID NO:4" refers to a METH2 
polypeptide sequence. 

As indicated, nucleic acid molecules of the present invention which encode 
a METH1 or METH2 polypeptide may include, but are not limited to, those 
encoding the amino acid sequence of the mature polypeptide, by itself; the coding 
sequence for the mature polypeptide and additional sequences, such as those 
encoding the leader or secretory sequence, such as a pre-, or pro- or prepro- 
protein sequence; the coding sequence of the mature polypeptide, with or without 
the aforementioned additional coding sequences, together with additional, 
non-coding sequences, including for example, but not limited to introns and 
non-coding 5 ' and 3 ' sequences, such as the transcribed, non-translated sequences 
that play a role in transcription, mRNA processing, including splicing and 
polyadenylation signals, for example - ribosome binding and stability of mRNA; 
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an additional coding sequence which codes for additional amino acids, such as 
those which provide additional functionalities. Thus, the sequence encoding the 
polypeptide may be fused to a marker sequence, such as a sequence encoding a 
peptide which facilitates purification of the fused polypeptide. In certain preferred 
5 embodiments of this aspect of the invention, the marker amino acid sequence is a 

hexa-histidine peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), 
among others, many of which are commercially available. As described in Gentz 
eial. Proa Nail. Acad Sci. USA 56:821-824 (1989), for instance, hexa-histidine 
provides for convenient purification of the fusion protein. The "HA" tag is 

10 another peptide useful for purification which corresponds to an epitope derived 

from the influenza hemagglutinin protein, which has been described by Wilson el 
ai f Cell 37:767-778 (1984). As discussed below, other such fusion proteins 
- - ■ include the-METHl or-METH2 fused to Fc at the N- or-G-terminus. 

The present invention further relates to variants of the nucleic acid 

1 5 molecules of the present invention, which encode portions, analogs or derivatives 

of theMETHl or METH2 protein. Variants may occur naturally, such as a natural 
allelic variant. By an "allelic variant" is intended one of several alternate forms of 
a gene occupying a given locus on a chromosome of an organism. Lewin, B., ed., 
Genes IJ \ John Wiley & Sons, New York (1985). Non-naturally occurring 

20 variants may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions 
or additions, which may involve one or more nucleotides. The variants may be 
altered in coding regions, non-coding regions, or both. Alterations in the coding 
regions may produce conservative or non-conservative amino acid substitutions, 

25 deletions or additions. Especially preferred among these are silent substitutions, 

additions and deletions, which do not alter the properties and activities of the 
METH1 or METH2 protein or portions thereof. Also especially preferred in this 
regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid 

30 molecules comprising a polynucleotide having a nucleotide sequence at least 95% 
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identical, and more preferably at least 96%, 97%, 98% or 99% identical to: a 
nucleotide sequence encoding the polypeptide having the amino acid sequence in 
SEQ ID NO:2; a nucleotide sequence encoding the polypeptide having the amino 
acid sequence in SEQ ED NO:2, but lacking the N-terminal methionine; a 
nucleotide sequence encoding the polypeptide having the amino acid sequence at 
positions from about 29 to about 950 in SEQ ED NO:2; a nucleotide sequence 
encoding the polypeptide having the amino acid sequence at position from about 
30 to about 950 in SEQ ID NO:2; a nucleotide sequence encoding the polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 209581; a nucleotide sequence encoding the mature METH1 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 209581 ; a nucleotide sequence encoding amino 
acids 235 to 459 in SEQ ID NO:2 (the metalloprotease domain of METHl); a - 
nucleotide sequence encoding amino acids 460 to 544 in SEQ ID NO: 2 (the 
disintegrin domain of METHl); a nucleotide sequence encoding amino acids 545 
to 598 in SEQ ID NO:2 (the first TSP-like domain of METHl); a nucleotide 
sequence encoding amino acids 841 to 894 in SEQ ED NO:2 (the second TSP-like 
domain of METHl); a nucleotide sequence encoding amino acids 895 to 934 in 
SEQ ID NO:2 (the third TSP-like domain of METHl); a nucleotide sequence 
encoding amino acids 536 to 613 in SEQ ID NO:2; a nucleotide sequence 
encoding amino acids 549 to 563 in SEQ ED NO:2; a nucleotide sequence 
encoding the polypeptide having the amino acid sequence in SEQ ID NO:4; a 
nucleotide sequence encoding the polypeptide having the amino acid sequence in 
SEQ ED NO:4, but lacking the N-terminal methionine; a nucleotide sequence 
encoding the polypeptide having the amino acid sequence at positions from about 
24 to about 890 in SEQ ED NO:4; a nucleotide sequence encoding the polypeptide 
having the amino acid sequence at positions from about 1 12 to about 890 in SEQ 
ID NO:4; a nucleotide sequence encoding the polypeptide having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 209582; 
a nucleotide sequence encoding the mature METH2 polypeptide having the amino 
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acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209582; a nucleotide sequence encoding amino acids 2 1 4 to 439 in SEQ ID NO:4 
(the metalloprotease domain of METH2); a nucleotide sequence encoding amino 
acids 440 to 529 in SEQ ID NO:4 (the disintegrin domain of METH2); a 
nucleotide sequence encoding amino acids 530 to 583 in SEQ ID NO:4 ( the first 
TSP-like domain of METH2); a nucleotide sequence encoding amino acids 837 
to 890 in SEQ ID NO:4 (the second TSP-like domain of METH2); a nucleotide 
sequence encoding amino acids 280 to 606 in SEQ ID NO:4; a nucleotide 
sequence encoding amino acids 529 to 548 in SEQ ID NO:4; or a nucleotide 
sequence complementary to any of the above nucleotide sequences . 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical* 1 to a reference nucleotide sequence encoding a METH1 or 
METH2 polypeptide is— intended- that the nucleotide sequence of the 
polynucleotide is identical to the reference sequence except that the polynucleotide 
sequence may include up to five point mutations per each 1 00 nucleotides of the 
reference nucleotide sequence encoding the METH1 or METH2 polypeptide. In 
other words, to obtain a polynucleotide having a nucleotide sequence at least 95% 
identical to a reference nucleotide sequence, up to 5% of the nucleotides in the 
reference sequence may be deleted or substituted with another nucleotide, or a 
number of nucleotides up to 5% of the total nucleotides in the reference sequence 
may be inserted into the reference sequence. These mutations of the reference 
sequence may occur at the 5' or 3 ' terminal positions of the reference nucleotide 
sequence or anywhere between those terminal positions, interspersed either 
individually among nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 95%, 96%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in SEQ ID NO: 1 or SEQ ED NO:3 or to the nucleotide sequence 
of the deposited cDNA clones can be determined conventionally using known 
computer programs such as the Bestfit program (Wisconsin Sequence Analysis 
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Package, Version 8 for Unix, Genetics Computer Group, University Research 
Park, 575 Science Drive, Madison, WI 537 11). Bestfit uses the local homology 
algorithm of Smith and Waterman, Advances in Applied Mathematics 2: 482-489 
(1981), to find the best segment of homology between two sequences. When 
using Bestfit or any other sequence alignment program to determine whether a 
particular sequence is, for instance, 95% identical to a reference sequence 
according to the present invention, the parameters are set, of course, such that the 
percentage of identity is calculated over the full length of the reference nucleotide 
sequence and that gaps in homology of up to 5% of the total number of 
nucleotides in the reference sequence are allowed. 

A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also 
referred to as a global sequence ^alignment, canbe determined using the FASTDB 
computer program based on the algorithm of Brutlag et aL, Comp, Appl. BioscL 
6:231-245 (1990). In a sequence alignment, the query and subject sequences are 
both DNA sequences. An RNA sequence can be compared by converting U's to 
T f s. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB alignment of DNA sequences to calculate percent 
identity are; Matrix=Unitary, k-tuple=4, Mismatch Penalty= 1 , Joining Penalty=3 0, 
Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size 
Penalty=0.05, Window Size=500 or the length of the subject nucleotide sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5* or 
3' deletions, not because of internal deletions, a manual correction must be made 
to the results. This is because the FASTDB program does not account for 5' and 
3 1 truncations of the subject sequence when calculating percent identity. For 
subject sequences truncated at the 5' or 3' ends, relative to the query sequence, the 
percent identity is corrected by calculating the number of bases of the query 
sequence that are 5' and 3* of the subject sequence, which are not matched/aligned, 
as a percent of the total bases of the query sequence. Whether a nucleotide is 
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matched/aligned is determined by the results of the FASTDB sequence alignment. 
This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final 
percent identity score. This corrected score is what is used for the purposes of the 
present invention. Only bases outside the 5' and 3' bases of the subject sequence, 
as displayed by the FASTDB alignment, which are not matched/aligned with the 
query sequence are calculated for the purposes of manually adjusting the percent 
identity score. 

For example, a 90 base subject sequence is aligned to a 1 00 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the 
subject sequence and, therefore, the FASTDB alignment does not show a 
match/alignment of the first 10 bases at the 5' end. The 10 unpaired bases 
represent 10%~of~the sequence (number of bases at the 5' and 3* ends not- 
matched/total number of bases in the query sequence), so 1 0% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 
90 bases were perfectly matched the final percent identity would be 90%. In 
another example, a 90 base subject sequence is compared with a 100 base query 
sequence. This time the deletions are internal, so that there are no bases on the 
5' or 3* ends of the subject sequence which are not matched/aligned with the query. 
In this case, the percent identity calculated by FASTDB is not manually corrected. 
One again, only bases 5 1 and 3* of the subject sequence which are not 
matched/aligned with the query sequence are manually corrected for. No other 
manual corrections are to be made for the purposes of the present invention. 

The present application is directed to nucleic acid molecules at least 95%, 
96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in SEQ ED 
NO:l or SEQ ID NO:3 or to the nucleic acid sequence of the deposited cDNAs, 
irrespective of whether they encode a polypeptide having METH1 or METH2 
activity. This is because even where a particular nucleic acid molecule does not 
encode a polypeptide having METH1 or METH2 activity, one of skill in the art 
would still know how to use the nucleic acid molecule, for instance, as a 
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hybridization probe or a polymerase chain reaction (PCR) primer. Uses of the 
nucleic acid molecules of the present invention that do not encode a polypeptide 
having METH1 or METH2 activity include, inter alia, (!) isolating the METH1 
or METH2 gene or allelic variants thereof in a cDNA library; (2) in situ 
hybridization (e.g., "FISH") to metaphase chromosomal spreads to provide precise 
chromosomal location of the METH1 or METH2 gene, as described in Verma et 
ah, Human Chromosomes: A Manual of Basic Techniques^ Pergamon Press, New 
York (1988); and (3) Northern Blot analysis for detecting METH1 or METH2 
mRNA expression in specific tissues. 

Preferred, however, are nucleic acid molecules having sequences at least 
95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in 
SEQ ID NO:l or SEQ ED NO:3 or to a nucleic acid sequence of the deposited 
cDNAs which do, in fact, encode a polypeptide having METH1 or METH2 
protein activity. By "a polypeptide having METH1 activity" is intended 
polypeptides exhibiting METH1 activity in a particular biological assay. For 
example, METH1 protein activity can be measured using the chorioallantoic 
membrane assay (Iruela-Arispe et aL, Thrombosis and Haemostasis 78(1) :612- 
677 (1997)) or the cornea pocket assay (Tolsmae/a/., J, Cell Biol. 722:497-5 1 1 
(1993)), both described in Example 4, below. By "a polypeptide having METH2 
activity" is intended polypeptides exhibiting METH2 activity in a particular 
biological assay. For example, METH2 protein activity can also be measured 
using the chorioallantoic membrane assay (Iruela-Arispe et al. t Thrombosis and 
Haemostasis 78(1) : 672-677 ( 1 997)) or the cornea pocket assay (Tolsma et al , J. 
Cell Biol 722:497-51 1 (1993)), both described in Example 4, below. 

Briefly, in the chorioallantoic assay, the potentially anti-angiogenic 
compound of interest is added to type I collagen pellets (Vitrogen), along with an 
angiogenic growth factor, such as bFGF. The samples are mixed and placed onto 
nylon meshes, and allowed to polymerize. After polymerization is complete, the 
meshes are placed onto the chorioallantoic membrane of 12 day old chick embryos 
and placed at 37 °C for 24 hours. The embryos then injected with a fluorescent 
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agent, such as FITC-dextran, and the meshes are fixed and mounted for 
observation under a fluorescent microscope. 

■ In the cornea pocket assay, hydron pellets containing the compound of 
interest and an angiogenic growth factor, such as bFGF, are implanted 1 to 2mm 
from the limbus of the cornea of rats or mice. Response is examined after a period 
of time, for example 5 days. The extent of angiogenesis is evaluated by measuring 
the capillaries migrating from the limb of the cornea. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill 
in the art will immediately recognize that a large number of the nucleic acid 
molecules having a sequence at least 95%, 96%, 97%, 98%, or 99% identical to 
a nucleic acid sequence of the deposited cDNAs or a nucleic acid sequence shown 
in SEQ ID NO: 1 or SEQ ID NO:3 will encode a polypeptide "having METH1 or 
METH2 protein activity. " In fact, since degenerate variants of these nucleotide 
sequences all encode the same polypeptide, this will be clear to the skilled artisan 
even without performing the above described comparison assay. It will be further 
recognized in the art that, for such nucleic acid molecules that are not degenerate 
variants, a reasonable number will also encode a polypeptide having METH1 or 
METH2 protein activity. This is because the skilled artisan is fully aware of amino 
acid substitutions that are either less likely or not likely to significantly effect 
protein function (e.g., replacing one aliphatic amino acid with a second aliphatic 
amino acid). 

For example, guidance concerning how to make phenotypically silent 
amino acid substitutions is provided in Bowie, J. U. et al. t "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions, 11 Science 
24 7: 1 3 06- 1310(1 990), wherein the authors indicate that proteins are surprisingly 
tolerant of amino acid substitutions. 
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Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of METH1 or 
METH2 polypeptides or fragments thereof by recombinant techniques. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in a 
precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate promoter, 
s V*ph ?§Jhe pha ge lambda PL promoter, the E. coli lac, trp and tac promoters, the _i 
S V40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
constructs will further contain sites for transcription initiation, termination and, in 
the transcribed region, a ribosome binding site for translation. The coding portion 
of the mature transcripts expressed by the constructs will preferably include a 
translation initiating at the beginning and a termination codon (UAA, VGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes 
for culturing in E. coli and other bacteria. Representative examples of appropriate 
hosts include, but are not limited to, bacterial cells, such as E. coli, Streptomyces 
and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells 
such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 
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Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript 
vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRITS available from Pharmacia. 
Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL 
available from Pharmacia. Other suitable vectors will be readily apparent to the 
skilled artisan. 

In addition to the use of expression vectors in the practice of the present 
invention, the present invention further includes novel expression vectors 
comprising operator and promoter elements operatively linked to nucleotide 
sequences encoding a protein of interest. One example of such a vector is pHE4-5 
— which is described in detail below. — 

As summarized in Figures 8 and 9, components of the pHE4-5 vector 
(SEQ ID NO: 12) include: 1) a neomycinphosphotransferase gene as a selection 
marker, 2) an E. coli origin of replication, 3) a T5 phage promoter sequence, 
4) two lac operator sequences, 5) a Shine-Delgarno sequence, 6) the lactose 
operon repressor gene (laclq). The origin of replication (oriC) is derived from 
pUC19 (LTI, Gaithersburg, MD). The promoter sequence and operator 
sequences were made synthetically. Synthetic production of nucleic acid 
sequences is well known in the art. CLONTECH 95/96 Catalog, pages 215-216, 
CLONTECH, 1020 East Meadow Circle, Palo Alto, CA 94303. A nucleotide 
sequence encoding METH1 (SEQ ID NO:2) or METH2 (SEQ ID NO:4), is 
operatively linked to the promoter and operator by inserting the nucleotide 
sequence between the Ndel and Asp71 8 sites of the pHE4-5 vector. 

As noted above, the pHE4-5 vector contains a laclq gene. Laclq is an 
allele of the lacl gene which confers tight regulation of the lac pperator. Amann, 
E. et al t Gene 69:301-315 (1988); Stark, M, Gene 57:255-267 (1987). The 
laclq gene encodes a repressor protein which binds to lac operator sequences and 
blocks transcription of down-stream (/.e., 3') sequences. However, the laclq gene 
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product dissociates from the lac operator in the presence of either lactose or 
certain lactose analogs, e.g., isopropyl B-D-thiogalactopyranoside (IPTG). 
METH1 or METH2 thus is not produced in appreciable quantities in uninduced 
host cells containing the pHE4-5 vector. Induction of these host cells by the 
addition of an agent such as IPTG, however, results in the expression of the 
METH1 or METH2 coding sequence. 

The promoter/operator sequences of the pHE4-5 vector (SEQ ED NO: 13) 
comprise a T5 phage promoter and two lac operator sequences. One operator is 
located 5* to the transcriptional start site and the other is located 3* to the same 
site. These operators, when present in combination with the laclq gene product, 
confer tight repression of down-stream sequences in the absence of a lac operon 
inducer, e.g., IPTG. Expression of operatively linked sequences located down- 
stream J"romjthe7ac operators may be induced, by the addition .of a lac operotn. 
inducer, such as IPTG. Binding of a lac inducer to the laclq proteins results in 
their release from the lac operator sequences and the initiation of transcription of 
operatively linked sequences. Lac operon regulation of gene expression is 
reviewed in Devlin, T., Textbook of Biochemistry with Clinical 
Correlations, 4th Edition (1997), pages 802-807. 

The pHE4 series of vectors contain all of the components of the pHE4-5 
vector except for the METH1 or METH2 coding sequence. Features of the pHE4 
vectors include optimized synthetic T5 phage promoter, lac operator, and 
Shine-Delgarno sequences. Further, these sequences are also optimally spaced so 
that expression of an inserted gene may be tightly regulated and high level of 
expression occurs upon induction. 

Among known bacterial promoters suitable for use in the production of 
proteins of the present invention include the E. coli lacl and lacZ promoters, the 
T3 and T7 promoters, the gpi promoter, the lambda PR and PL promoters and the 
trp promoter. Suitable eukaryotic promoters include the CMV immediate early 
promoter, the HS V thymidine kinase promoter, the early and late SV40 
promoters, the promoters of retroviral LTRs, such as those of the Rous Sarcoma 
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Virus (RS V), and metallothionein promoters, such as the mouse metallothionein-I 
promoter. 

The pHE4-5 vector also contains a Shine-Delgarno sequence 5' to the 
AUG initiation codon. Shine-Delgarno sequences are short sequences generally 
located about 10 nucleotides up-stream (i.e., 5*) from the AUG initiation codon. 
These sequences essentially direct prokaryotic ribosomes to the AUG initiation 
codon. 

Thus, the present invention is also directed to expression vector useful for 
the production of the proteins of the present invention. This aspect of the 
invention is exemplified by the pHE4-5 vector (SEQ ID NO: 12). 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals, such 
as Davis et al t Basic Methods In Molecular Biology (1 986). 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional 
heterologous functional regions! For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. Such regions may be 
removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques in 
the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 
533 (Canadian counterpart 2045869) discloses fusion proteins comprising various 
portions of constant region of immunoglobin molecules together with another 
human protein or part thereof. In many cases, the Fc part in a fusion protein is 
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thoroughly advantageous for use in therapy and diagnosis and thus results, for 
example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 
hand, for some uses it would be desirable to be able to delete the Fc part after the 
fusion protein has been expressed, detected and purified in the advantageous 
manner described. This is the case when the Fc portion proves to be a hindrance 
to use in therapy and diagnosis, for example when the fusion protein is to be used 
as an antigen for immunizations. In drug discovery, for example, human proteins, 
such as the hIL5-receptor, have been fused with Fc portions for the purpose of 
high-throughput screening assays to identify antagonists of hIL-5. See, D. 
Bennett et aL, J. Mot Recognition 5:52-58 (1995) and K. Johanson e/ a/., J. of 
Biol Chem. 2 70(1 6) :9459-9471 (1995). 

The METH1 or METH2 protein can be recovered and purified from 
recombinant cell cultures by-well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
("HPLC") is employed for purification. Polypeptides of the present invention 
include naturally purified products, products of chemical synthetic procedures, and 
products produced by recombinant techniques from a prokaryotic or eukaryotic 
host, including, for example, bacterial, yeast, higher plant, insect and mammalian 
cells. Depending upon the host employed in a recombinant production procedure, 
the polypeptides of the present invention may be glycosylated or may be 
non-glycosylated. In addition, polypeptides of the invention may also include an 
initial modified methionine residue, in some cases as a result of host-mediated 
processes. 
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METH1 and METH2 Polypeptides and Fragments 

The invention further provides an isolated METH1 polypeptide having the 
amino acid sequence encoded by the deposited cDNA, or the amino acid sequence 
in SEQ ID NO:2, or a peptide or polypeptide comprising a portion of the above 
polypeptides. The invention also provides an isolated METH2 polypeptide having 
the amino acid sequence encoded by the deposited cDNA, or the amino acid 
sequence in SEQ ID NO:4, or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

METH1 or METH2 polypeptides can be composed of amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, 
and may contain amino acids other than the 20 gene-encoded amino acids. The 
METHl or METH2 polypeptides may 6e"modified by either natural processes, 
such as posttranslational processing, or by chemical modification techniques which 
are well known in the art. Such modifications are well described in basic texts and 
in more detailed monographs, as well as in a voluminous research literature. 
Modifications can occur anywhere in the METHl or METH2 polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or 
carboxyl termini. It will be appreciated thaj the same type of modification may be 
present in the same or varying degrees at several sites in a given METHl or 
METH2 polypeptide. Also, a given METHl or METH2 polypeptide may contain 
many types of modifications. METHl orMETH2 polypeptides may be branched 
, for example, as a result of ubiquitination, and they may be cyclic, with or without 
branching. Cyclic, branched, and branched cyclic METHl or METH2 
polypeptides may result from posttranslation natural processes or may be made by 
synthetic methods. Modifications include acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or nucleotide derivative, 
covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, 
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demethylation, formation of covalent cross-links, formation of cysteine, formation 
of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-KNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, 
W. H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York, pgs. 1-12 (1983); Seifter et al 3 Meth Enzymol 752:626-646 
(1990); Rattan etal, Ann NY Acad Sci 663:48-62 (1992).) 

It will be recognized in the art that some amino acid sequences of the 
METH-l-and-METH2-polypeptides can be varied without significant effect of the 
structure or function of the protein. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the 
protein which determine activity. 

The present inventors have shown that METH1 and METH2 inhibit 
angiogenesis in vitro and in vivo. METH1 and METH2 each contain a 
metalloprotease domain, a disintegrin domain, and TSP-like domains. The 
metalloprotease domain may be catalytically active. The disintegrin domain may 
play a role in inhibiting angiogenesis by interacting with integrins, since integrins 
are essential for the mediation of both proliferative and migratory signals. The 
present inventors have shown that peptides derived from the TSP-like domains of 
METH1 and METH2 inhibit angiogenesis in vitro and in vivo. 

Thus, the invention further includes variations of the METH 1 polypeptide 
which show substantial METH1 polypeptide activity or which include regions of 
METH1 protein such as the protein portions discussed below; and variations of 
the METH2 polypeptide which show substantial METH2 polypeptide activity or 
which include regions of METH2 protein such as the protein portions discussed 
below. Such mutants include deletions, insertions, inversions, repeats, and type 
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substitutions. As indicated above, guidance concerning which amino acid changes 
are likely to be phenotypically silent can be found in Bowie, J.U., et aL, 
"Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 
Substitutions," Science 247:1306-1310 (1990). 

Thus, the fragment, derivative or analog of the polypeptide of SEQ ID 
NO:2 or SEQ ID NO:4, or that encoded by the deposited cDNA, may be (i) one 
in which one or more of the amino acid residues are substituted with a conserved 
or non-conserved amino acid residue (preferably a conserved amino acid residue) 
and such substituted amino acid residue may or may not be one encoded by the 
genetic code, or (ii) one in which one or more of the amino acid residues includes 
a substituent group, or (iii) one in which the mature polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide, 
(for example, polyethylene glycol), or (iv) one in which the additional amino acids 
are fused to the mature polypeptide, such as an IgG Fc fusion region peptide or 
leader or secretory sequence or a sequence which is employed for purification of 
the mature polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogs are deemed to be within the scope of those skilled in the art from the 
teachings herein. 

Of particular interest are substitutions of charged amino acids with another 
charged amino acid and with neutral or negatively charged amino acids. The latter 
results in proteins with reduced positive charge to improve the characteristics of 
the METH1 or METH2 proteins. The prevention of aggregation is highly 
desirable. Aggregation of proteins not only results in a loss of activity but can also 
be problematic when preparing pharmaceutical formulations, because they can be 
immunogenic. (Pinckard etaL, Clin. Exp. Immunol. 2:331-340 (1967); Robbins 
etal. t Diabetes 35:838-845 (1987); Cleland et al. t Crit. Rev. Therapeutic Drug 
Carrier Systems 70:307-377 (1993)). 

As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding 
or activity of the protein (see Table 3). 



WO 99/37660 



PCT/US99/01313 



-58- 



TABLE 3. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


xiy u ropnoui u 


Leucine 




TQnlf*nr:iriP 

ldUlvUwlliw 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 


j— 


Threonine 




Methionine 




Glycine 



Of course, the number of amino acid substitutions a skilled artisan would 
make depends on many factors, including those described above. Generally 

10 speaking, the number of amino acid substitutions for any given METH1 or 

METH2 polypeptide will not be more than 50, 40, 30, 20, 10, 5, or 3. 

Amino acids in the METH1 and METH2 proteins of the present invention 
that are essential for function can be identified by methods known in the art, such 
as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and 

15 Wells, Science 244\ 108 1-1 085 (1 989)). The latter procedure introduces single 

alanine mutations at every residue in the molecule. The resulting mutant 
molecules are then tested for biological activity such as in vitro or in vivo 
inhibition of angiogenesis. Sites that are critical for inhibition of angiogenesis can 
also be determined by structural analysis such as crystallization, nuclear magnetic 
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resonance or photoaffinity labeling (Smith etal, J. Mol Biol. 224:899-904 (1992) 
and de Vos et ai, Science 255:306-3 12 (1992)). 

The polypeptides of the present invention are preferably provided in an 
isolated form. By "isolated polypeptide" is intended a polypeptide removed from 
5 its native environment. Thus, a polypeptide produced and/or contained within a 

recombinant host cell is considered isolated for purposes of the present invention. 
Also intended as an "isolated polypeptide" are polypeptides that have been 
purified, partially or substantially, from a recombinant host cell or from a native 
source. For example, a recombinantly produced version of the METH1 or 

10 METH2 polypeptide can be substantially purified by the one-step method 

described in Smith and Johnson, Gene 57:31-40 (1988). 

The polypeptides of the present invention include the METH1 polypeptide 
encoded by the deposited cDNAlnduding the leader; the mature METH1 
polypeptide encoded by the deposited the cDN A minus the leader (i.e., the mature 

15 protein); a polypeptide comprising amino acids about 1 to about 950 in SEQ ID 

NO:2; a polypeptide comprising amino acids about 2 to about 950 in SEQ ED 
NO:2; a polypeptide comprising amino acids about 29 to about 950 in SEQ ID 
NO:2; a polypeptide comprising amino acids about 30 to about 950 in SEQ ID 
NO:2; a polypeptide comprising the metalloprotease domain of METH1, amino 

20 acids 235 to 459 in SEQ ID NO:2; a polypeptide comprising the disintegrin 

domain of METH1, amino acids 460 to 544 in SEQ ID NO:2; a polypeptide 
comprising the first TSP-Iike domain of METH1, amino acids 545 to 598 in SEQ 
ID NO:2; a polypeptide comprising the second TSP-like domain of METH1, 
amino acids 84 1 to 894 in SEQ ID NO:2; a polypeptide comprising the third TSP- 

25 like domain of METH1 , amino acids 895 to 934 in SEQ ID NO:2; a polypeptide 

comprising amino acids 536 to 613 in SEQ ID NO:2; a polypeptide comprising 
amino acids 549 to 563 in SEQ ID NO:2; the METH2 polypeptide encoded by the 
deposited cDNA including the leader; the mature METH2 polypeptide encoded 
by the deposited the cDNA minus the leader (i.e., the mature protein); a 

30 polypeptide comprising amino acids about 1 to about 890 in SEQ ID NO:4; a 
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polypeptide comprising amino acids about 2 to about 890 in SEQ ID NO:4, a 
polypeptide comprising amino acids about 24 to about 890 in SEQ ED NO:4; a 
polypeptide comprising amino acids about 1 12 to about 890 in SEQ ED NO:4; a 
polypeptide comprising the metalloprotease domain of METH2, amino acids 214 
to 439 in SEQ ID NO:4; a polypeptide comprising the disintegrin domain of 
METH2, amino acids 440 to 529 in SEQ ED NO:4; a polypeptide comprising the 
first TSP-Iike domain of METH2, amino acids 530 to 583 in SEQ ID NO:4; a 
polypeptide comprising the second TSP-like domain of METH2, amino acids 837 
to 890 in SEQ ID NO:4; a polypeptide comprising amino acids 280 to 606 in SEQ 
ID NO:4; a polypeptide comprising amino acids 529 to 548 in SEQ EE) NO:4; as 
well as polypeptides which are at least 95% identical, and more preferably at least 
96%, 97%, 98% or 99% identical to the polypeptides described above ancj, also 
include portions^of such polypeptides with at least 30 amino acids and more" 
preferably at least 50 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a reference amino acid sequence of a METH1 or METH2 
polypeptide is intended that the amino acid sequence of the polypeptide is identical 
to the reference sequence except that the polypeptide sequence may include up to 
five amino acid alterations per each 100 amino acids of the reference amino acid 
of the METH1 or METH2 polypeptide. In other words, to obtain a polypeptide 
having an amino acid sequence at least 95% identical to a reference amino acid 
sequence, up to 5% of the amino acid residues in the reference sequence may be 
deleted or substituted with another amino acid, or a number of amino acids up to 
5% of the total amino acid residues in the reference sequence may be inserted into 
the reference sequence. These alterations of the reference sequence may occur at 
the amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 
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As a practical matter, whether any particular polypeptide is at least 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence shown 
in SEQ ID NO:2 or SEQ ID NO:4 or to the amino acid sequence encoded by 
deposited cDN A clones can be determined conventionally using known computer 
programs such the Bestfit program (Wisconsin Sequence Analysis Package, 
Version 8 for Unix, Genetics Computer Group, University Research Park, 575 
Science Drive, Madison, WI 5371 1). When using Bestfit or any other sequence 
alignment program to determine whether a particular sequence is, for instance, 
95% identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated 
over the full length of the reference amino acid sequence and that gaps in 
homology of up to 5% of the total number of amino acid residues in the reference 
sequence are allowed. 

A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also 
referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et al. y Comp. App. Biosci. 
6:237-245 (1990). In a sequence alignment, the query and subject sequences are 
either both nucleotide sequences or both amino acid sequences. The result of said 
global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch 
Penalty=l , Joining Penalty=20, Randomization Group Length=0, Cutoff Score= 1 , 
Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the 
subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be 
made to the results. This is because the FASTDB program does not account for 
N- and C-terminal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termini, relative 
to the query sequence, the percent identity is corrected by calculating the number 
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of residues of the query sequence that are N- and C-terminal of the subject 
sequence, which are not matched/aligned with a corresponding subject residue, as 
a percent of the total residues of the query sequence. Whether a residue is 
matched/aligned is determined by the results of the FASTDB sequence alignment. 
This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final 
percent identity score. This final percent identity score is what is used for the 
purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That 
is, only query residue positions outside the farthest N- and C-terminal residues of 
the subject sequence. 

For example; a 90 amino acid residue subject sequence is aligned with a 
1 00 residue query sequence to determine percent identity. The deletion occurs at 
the N-terminus of the subject sequence and therefore, the FASTDB alignment 
does not show a match/alignment of the first 10 residues at the N-terminus. The 
1 0 unpaired residues represent 1 0% of the sequence (number of residues at the N- 
and C-termini not matched/total number of residues in the query sequence) so 
10% is subtracted from the percent identity score calculated by the FASTDB 
program. If the remaining 90 residues were perfectly matched, the final percent 
identity would be 90%. In another example, a 90 residue subject sequence is 
compared with a 1 00 residue query sequence. This time, the deletions are internal, 
so there are no residues at the N- or C-termini of the subject sequence which are 
not matched/aligned with the query. In this case, the percent identity calculated 
by FASTDB is not manually corrected. Once again, only residue positions outside 
the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB 
alignment, which are not matched/aligned with the query sequence are manually 
corrected for. No other manual corrections are made for the purposes of the 
present invention. 
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The polypeptides of the present invention are useful as a molecular weight 
marker on SDS-PAGE gels or on molecular sieve gel filtration columns using 
methods well known to those of skill in the art. 

In another aspect, the invention provides a peptide or polypeptide 
5 comprising an epitope-bearing portion of a polypeptide of the invention. The 

epitope of this polypeptide portion is an immunogenic or antigenic epitope of a 
polypeptide described herein. An "immunogenic epitope" is defined as a part of 
a protein that elicits an antibody response when the whole protein is the 
immunogen. On the other hand, a region of a protein molecule to which an 
10 antibody can bind is defined as an "antigenic epitope." The number of 

immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, for instance, Geysen et ai, Proc. Nail Acad. Sci. USA 8J:399S r 
4002(1983). ~~ — 

As to the selection of peptides or polypeptides bearing an antigenic epitope 
15 (i.e., that contain a region of a protein molecule to which an antibody can bind), 

it is well known in that art that relatively short synthetic peptides that mimic part 
of a protein sequence are routinely capable of eliciting an antiserum that reacts 
with the partially mimicked protein. See, for instance, Sutcliffe, J. G. et ai y 
"Antibodies that react with predetermined sites on proteins", Science 2/9:660-666 
20 (1983). Peptides capable of eliciting protein-reactive sera are frequently 

represented in the primary sequence of a protein, can be characterized by a set of 
simple chemical rules, and are confined neither to immunodominant regions of 
intact proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. 

25 Antigenic epitope-bearing peptides and polypeptides of the invention are 

therefore useful to raise antibodies, including monoclonal antibodies, that bind 
specifically to a polypeptide of the invention. See, for instance, Wilson et aL, Cell 
37:167-11% (1984) at 777. 

Antigenic epitope-bearing peptides and polypeptides of the invention 

30 preferably contain a sequence of at least seven, more preferably at least nine and 



WO 99/37660 



-64- 



most preferably between about at least about 15 to about 30 amino acids 
contained within the amino acid sequence of a polypeptide of the invention. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means. Houghten, R. A., "General method for the 
rapid solid-phase synthesis of large numbers of peptides: specificity of 
antigen-antibody interaction at the level of individual amino acids", Proc. Nail. 
Acad. ScL USA 52:5131-5135 (1985). This "Simultaneous Multiple Peptide 
Synthesis (SMPS)" process is further described in U.S. Patent No. 4,63 1 ,21 1 to 
Houghten et ai (1986). 

As one of skill in the art will appreciate, METH1 or METH2 polypeptides 
of the present invention and the epitope-bearing fragments thereof described 
above can be combined with parts of the constant domain of immunoglobulins 
(IgG) - resulting in -chimeric polypeptides. These fusion proteins facilitate 
purification and show an increased half-life in vivo. This has been shown, e.g., for 
chimeric proteins consisting of the first two domains of the human 
CD4-polypeptide and various domains of the constant regions of the heavy or light 
chains of mammalian immunoglobulins (EPA 394,827; Traunecker et al. t Nature 
537:84- 86 (1988)). Fusion proteins that have a disulfide-linked dimeric structure 
due to the IgG part can also be more efficient in binding and neutralizing other 
molecules than the monomelic METH1 or METH2 protein or protein fragment 
alone (Fountoulakis etal, J. Biochem, 270:3958-3964 (1995)). 

METH1 and METH2 Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clones 
or shown in SEQ ED NO: 1 or SEQ ID NO:3 . The short nucleotide fragments are 
preferably at least about 1 5 nt, and more preferably at least about 20 nt, still more 
preferably at least about 30 nt, and even more preferably, at least about 40 nt in 
length. A fragment "at least 20 nt in length," for example, is intended to include 
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20 or more contiguous bases from the cDN A sequence contained in the deposited 
clones or the nucleotide sequence shown in SEQ ED NO:l or SEQ ID NO:3. 

These nucleotide fragments are useful as diagnostic probes and primers as 

) 

discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 2000 
nucleotides) are preferred. 

Moreover, representative examples ofMETHl orMETH2 polynucleotide 
fragments include, for example, fragments having a sequence from about 
nucleotide number 1 -50, 5 1 - 1 00, 1 0 1 - 1 50, 1 5 1 -200, 20 1 -250, 25 1 -3 00, 30 1 -350, 
351-400,401-450, 451-500,501-550,551-600,651-700, 701-750, 751-800, 800- 
850, 851-900, 901-950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 1151- 
1200, 1201-1250, 1251-1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 

1501-1550, 1551-1600, 1601-1650,-1651-1700, 1701-1750, 1751-1800, 1801- 

185071851-796071901-1950, 1951-2000, or 206Tto the end of SEQ ID NO:l ~~ 
or SEQ ID NO: 3 or the cDNA contained in the deposited clones. In this context 
"about" includes the particularly recited ranges, larger or smaller by several (5, 4, 
3, 2, or 1) nucleotides, at either terminus or at both termini. Preferably, these 
fragments encode a polypeptide which has biological activity. More preferably, 
these polynucleotides can be used as probes or primers as discussed herein. 

In the present invention, a "polypeptide fragment" refers to a short amino 
acid sequence contained in SEQ ID NO:2 or SEQ ID NO:4 or encoded by the 
cDNA contained in the deposited clones. Protein fragments may be "free- 
standing," or comprised within a larger polypeptide of which the fragment forms 
a part or region, most preferably as a single continuous region. Representative 
examples of polypeptide fragments of the invention, include, for example, 
fragments from about amino acid number 1 -20, 2 1 -40, 4 1 -60, 6 1 -80, 8 1 - 1 00, 1 02- 
120, 121-140, 141-160, 161-180, 181-200,201-220,221-240,241-260,261-280, 
or 281 to the end of the coding region or SEQ ID NO:2 pr SEQ ID NO:4. 
Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 1 00, 
1 1 0, 1 20, 130, 1 40, or 1 50 amino acids in length. In this context "about" includes 
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the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1 ) amino 
acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted METH 1 or METH2 
protein as well as the mature form. Further preferred polypeptide fragments 
include the secreted METH1 or METH2 protein or the mature form having a 
continuous series of deleted residues from the amino or the carboxy terminus, or 
both. For example, any number of amino acids, ranging from 1-60, can be deleted 
from the amino terminus of either the secreted METH1 or METH2 polypeptide 
or the mature form. Similarly, any number of amino acids, ranging from 1 -30, can 
be deleted from the carboxy terminus of the secreted METH1 or METH2 protein 
or mature form. Furthermore, any combination of the above amino and carboxy 
terminus deletions are preferred. Similarly, polynucleotide fragments encoding 
these METH1 or METH2 polypeptide fragments are also preferredr - 

Particularly, N-terminal deletions of the METH1 polypeptide can be 
described by the general formula m-950, where m is an integer from 2 to 949, 
where m corresponds to the position of the amino acid residue identified in SEQ 
ED NO:2. Preferably, N-terminal deletions of the METH1 polypeptide of the 
invention shown as SEQ ID NO:2 include polypeptides comprising the amino acid 
sequence of residues: G-2 to S-950; N-3 to S-950; A-4 to S-950; E-5 to S-950; 
R-6 to S-950; A-7 to S-950; P-8 to S-950; G-9 to S-950; S-10 to S-950; R-l 1 to 

S-950; S-12 to S-950; F-13 to S-950; G-14 to S-950; P-15 to S-950; V-16 to S- 

v 

950; P-l 7 to S-950; T-l 8 to S-950; L-l 9 to S-950; L-20 to S-950; L-21 to S-950; 
L-22 to S-950; A-23 to S-950; A-24 to S-950; A-25 to S-950; L-26 to S-950; L- 
27 to S-950; A-28 to S-950; V-29 to S-950; S-30 to S-950; D-3 1 to S-950; A-32 
to S-950; L-33 to S-950; G-34 to S-950; R-35 to S-950; P-36 to S-950; S-37 to 
S-950; E-38 to S-950; E-39 to S-950; D-40 to S-950; E-41 to S-950; E-42 to S- 
950; L-43 to S-950; V-44 to S-950; V-45 to S-950; P-46 to S-950; E-47 to S- 
950; L-48 to S-950; E-49 to S-950; R-50 to S-950; A-51 to S-950; P-52 to S- 
950; G-53 to S-950; H-54 to S-950; G-55 to S-950; T-56 to S-950; T-57 to S- 
950; R-58 to S-950; L-59 to S-950; R-60 to S-950; L-61 to S-950; H-62 to S- 
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950; A-63 to S-950; F-64 to S-950; D-65 to S-950; Q-66 to S-950; Q-67 to S- 
950; L-68 to S-950; D-69 to S-950; L-70 to S-950; E-71 to S-950; L-72 to S- 
950; R-73 to S-950; P-74 to S-950; D-75 to S-950; S-76 to S-950; S-77 to S-950; 
F-78 to S-950; L-79 to S-950; A-80 to S-950; P-81 to S-950; G-82 to S-950; F- 
5 83 to S-950; T-84 to S-950; L-85 to S-950; Q-86 to S-950; N-87 to S-950; V-88 

to S-950; G-89 to S-950; R-90 to S-950; K-91 to S-950; S-92 to S-950; G-93 to 
S-950; S-94 to S-950; E-95 to S-950; T-96 to S-950; P-97 to S-950; L-98 to S- 
950; P-99 to S-950; E-100 to S-950; T-101 to S-950; D-102 to S-950; L-103 to 
S-950; A-l 04 to S-950; H-l 05 to S-950; C- 1 06 to S-950; F-l 07 to S-950; Y- 1 08 

10 to S-950; S-109 to S-950; G-l 10 to S-950; T-l 1 1 to S-950; V-l 12 to S-950; N- 

1 13 to S-950; G-l 14 to S-950; D-l 15 to S-950; P-l 16 to S-950; S-l 17 to S-950; 
S-l 18 to S-950; A-l 19 to S-950; A-l 20 to S-950; A- 121 to S-950; L-122 to S- 
950; S-123 to S-950; L-124 to S-950; C-125 to S-950; E-126 to S-950; G-127 
to S-950; V-128 to S-950; R-129 to S-950; G-130 to S-950; A-13 1 to S-950; F- 

15 132 to S-950; Y-133 to S-950; L-134 to S-950; L-135 to S-950; G-136 to S-950; 

E-137 to S-950; A-138 to S-950; Y-139 to S-950; F-140 to S-950; 1-141 to S- 
950; Q-142 to S-950; P-143 to S-950; L-144 to S-950; P-145 to S-950; A-146 
to S-950; A- 147 to S-950; S-l 48 to S-950; E-149 to S-950; R-150 to S-950; L- 
151 to S-950; A-l 52 to S-950; T-l 53 to S-950; A- 154 to S-950; A- 155 to S-950; 

20 P-156 to S-950; G-157 to S-950; E-158 to S-950; K-159 to S-950; P-160 to S- 

950; P-161 to S-950; A-162 to S-950; P-163 to S-950; L-164 to S-950; Q-165 
to S-950; F-l 66 to S-950; H-l 67 to S-950; L-168 to S-950; L-169 to S-950; R- 
170 to S-950; R-171 to S-950; N-172 to S-950; R-173 to S-950; Q-174 to S-950; 
G-175 to S-950; D-176 to S-950; V-177 to S-950; G-178to S-950; G-l 79 to S- 

25 950; T-180 to S-950; C-181 to S-950; G-182 to S-950; V-183 to S-950; V-184 

to S-950; D-l 85 to S-950; D-l 86 to S-950; E-187 to S-950; P-l 88 to S-950; R- 
189 to S-950; P-190 to S-950; T-191 to S-950; G-192 to S-950; K-193 to S-950; 
A-194 to S-950; E-195 to S-950; T-196 to S-950; E-197 to S-950; D-198 to S- 
950; E-199 to S-950; D-200 to S-950; E-201 to S-950; G-202 to S-950; T-203 

30 to S-950; E-204 to S-950; G-205 to S-950; E-206 to S-950; D-207 to S-950; E- 
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208 to S-950; G-209 to S-950; P-210 to S-950; Q-21 1 to S-950; W-212 to S-950; 
S-213 to S-950; P-214 to S-950; Q-215 to S-950; D-216 to S-950; P-217 to S- 
950; A-218 to S-950; L-219 to S-950; Q-220to S-950; G-221 to S-950; V-222 
to S-950; G-223 to S-950; Q-224 to S-950; P-225 to S-950; T-226 to S-950; G- 
227 to S-950; T-228 to S-950; G-229 to S-950; S-230 to S-950; 1-23 1 to S-950; 
R-232 to S-950; K-233 to S-950; K-234 to S-950; R-235 to S-950; F-236 to S- 
950; V-237 to S-950; S-238 to S-950; S-239 to S-950; H-240 to S-950; R-241 
to S-950; Y-242 to S-950; V-243 to S-950; E-244 to S-950; T-245 to S-950; M- 
246 to S-950; L-247 to S-950; V-248 to S-950; A-249 to S-950; D-250to S-950; 
Q-251 to S-950; S-252 to S-950; M-253 to S-950; A-254 to S-950; E-255 to S- 
950; F-256 to S-950; H-257 to S-950; G-258 to S-950; S-259 to S-950; G-260 
to S-950; L-261 to S-950; K-262 to S-950; H-263 to S-950; Y-264 to S-950; L- 
265 to S-950; L-266 to S-950;T-267 to S-950; L-268 to S-950; F-269 to -S-950;- 
S-270 to S-950; V-271 to S-950; A-272 to S-950; A-273 to S-950; R-274 to S- 
950; L-275 to S-950; Y-276 to S-950; K-277 to S-950; H-278 to S-950; P-279 
to S-950; S-280 to S-950; 1-281 to S-950; R-282 to S-950; N-283 to S-950; S- 
284 to S-950; V-285 to S-950; S-286 to S-950; L-287 to S-950; V-288 to S-950; 
V-289 to S-950; V-290 to S-950; K-291 to S-950; 1-292 to S-950; L-293 to S- 
950; V-294 to S-950; 1-295 to S-950; H-296 to S-950; D-297 to S-950; E-298 
to S-950; Q-299 to S-950; K-300 to S-950; G-301 to S-950; P-302 to S-950; E- 
303 to S-950; V-304 to S-950; T-305 to S-950; S-306 to S-950; N-307 to S-950; 
A-308 to S-950; A-309 to S-950; L-310 to S-950; T-31 1 to S-950; L-312 to S- 
950; R-313 to S-950; N-314 to S-950; F-315 to S-950; C-316 to S-950; N-317 
to S-950; W-3 18 to S-950; Q-3 19 to S-950; K-320 to S-950; Q-321 to S-950; H- 
322 to S-950; N-323 to S-950; P-324 to S-950; P-325 to S-950; S-326 to S-950; 
D-327 to S-950; R-328 to S-950; D-329 to S-950; A-330 to S-950; E-331 to S- 
950; H-332 to S-950; Y-333 to S-950; D-334 to S-950; T-335 to S-950; A-336 
to S-950; 1-337 to S-950; L-338 to S-950; F-339 to S-950; T-340 to S-950; R- 
341 to S-950; Q-342 to S-950; D-343 to S-950; L-344 to S-950; C-345 to S-950; 
G-346 to S-950; S-347 to S-950; Q-348 to S-950; T-349 to S-950; C-350 to S- 
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950 D-351 to S-950; T-352 to S-950; L-353 to S-950; G-354 to S-950; M-355 
to S-950; A-356 to S-950; D-357 to S-950; V-358 to S-950; G-359 to S-950; T- 
360 to S-950; V-361 to S-950; C-362 to S-950; D-363 to S-950; P-364 to S-950; 
S-365 to S-950; R-366 to S-950; S-367 to S-950; C-368 to S-950; S-369 to S- 
950 V-370 to S-950; 1-371 to S-950; E-372 to S-950; D-373 to S-950; D-374 
to S-950; G-375 to S-950; L-376 to S-950; Q-377 to S-950; A-378 to S-950; A- 
379 to S-950; F-380 to S-950; T-38 1 to S-950; T-382 to S-950; A-383 to S-950; 
H-384 to S-950; E-385 to S-950; L-386 to S-950; G-387 to S-950; H-388 to S- 
950; V-389 to S-950; F-390 to S-950; N-391 to S-950; M-392 to S-950; P-393 
to S-950; H-394 to S-950; D-395 to S-950; D-396 to S-950; A^97 to S-950; K- 
398 to S-950; Q-399 to S-950; C-400 to S-950; A-401 to S-950; S-402 to S-950; 
L-403 to.S=95X); N.-404tp S-950; G-405 to S-950; V-406 to S-950; N-407 to S- 
950; Q-408 to S-950; D-409 to S-950; S-410 to S-950; H-41 1 to S-950; M-412 
to S-950; M-413 to S-950; A-414 to S-950; S-415 to S-950; M-416 to S-950; L- 
417 to S-950; S-41 8 to S-950; N-419 to S-950; L-420 to S-950; D-42 1 to S-950; 
H-422 to S-950; S-423 to S-950; Q-424 to S-950; P-425 to S-950; W-426 to S- 
950; S-427 to S-950; P-428 to S-950; C-429 to S-950; S-430 to S-950; A-43 1 to 
S-950; Y-432 to S-950; M-433 to S-950; 1-434 to S-950; T-435 to S-950; S-436 
to S-950; F-437 to S-950; L-438 to S-950; D-439 to S-950; N-440 to S-950; G- 
441 to S-950; H-442 to S-950; G-443 to S-950; E-444 to S-950; C-445 to S-950; 
L-446 to S-950; M-447 to S-950; D-448 to S-950; K-449 to S-950; P-450 to S- 
950; Q-45 1 to S-950; N-452 to S-950; P-453 to S-950; 1-454 to S-950; Q-455 to 
S-950; L-456 to S-950; P-457 to S-950; G-458 to S-950; D-459 to S-950; L-460 
to S-950; P-461 to S-950; G-462 to S-950; T-463 to S-950; S-464 to S-950; Y- 
465 to S-950; D-466 to S-950; A-467 to S-950; N-468 to S-950; R-469 to S-950; 
Q-470 to S-950; C-471 to S-950; Q-472 to S-950; F-473 to S-950; T-474 to S- 
950; F-475 to S-950; G-476 to S-950; E-477 to S-950; D-478 to S-950; S-479 
to S-950; K-480 to S-950; H-481 to S-950; C-482 to S-950; P-483 to S-950; D- 
484 to S-950; A-485 to S-950; A-486 to S-950; S-487 to S-950; T-488 to S-950; 
C-489 to S-950; S-490 to S-950; T-491 to S-950; L-492 to S-950; W-493 to S- 
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950; C-494 to S-950; T-495 to S-950; G-496 to S-950; T-497 to S-950; S-498 
to S-950; G-499 to S-950; G-500 to S-950; V-501 to S-950; L-502 to S-950; V- 
503 to S-950; C-504 to S-950; Q-505 to S-950; T-506 to S-950; K-507 to S-950; 
H-508 to S-950; F-509 to S-950; P-5 1 0 to S-950; W-5 1 1 to S-950; A-5 1 2 to S- 
950; D-513 to S-950; G-514 to S-950; T-515 to S-950; S-516 to S-950; C-517 
to S-950; G-5 1 8 to S-950; E-51 9 to S-950; G-520 to S-950; K-521 to S-950; W- 
522 to S-950; C-523 to S-950; 1-524 to S-950; N-525 to S-950; G-526 to S-950; 
K-527 to S-950; C-528 to S-950; V-529 to S-950; N-530 to S-950; K-53 1 to S- 
950; T-532 to S-950; D-533 to S-950; R-534 to S-950; K-535 to S-950; H-536 
to S-950; F-537 to S-950; D-538 to S-950; T-539 to S-950; P-540 to S-950; F- 
541 to S-950; H-542 to S-950; G-543 to S-950; S-544 to S-950; W-545 to S-950; 
G-546 to S-950; M-547 to S-950;.W-548 to S-950; G-549 to S-950; P-550 to S- 
950;'w-55 1 to S-950; G-552 to S-950; D-553 to S-950; C-554 to S-950; S-555 
to S-950; R-556 to S-950; T-557 to S-950; C-558 to S-950; G-559 to S-950; G- 
560 to S-950; G-561 to S-950; V-562 to S-950; Q-563 to S-950; Y-564 to S-950; 
T-565 to S-950; M-566 to S-950; R-567 to S-950; E-568 to S-950; C-569 to S- 
950; D-570 to S-950; N-571 to S-950; P-572 to S-950; V-573 to S-950; P-574 
to S-950; K-575 to S-950; N-576 to S-950; G-577 to S-950; G-578 to S-950; K- 
579 to S-950; Y-580 to S-950;'C-581 to S-950; E-582 to S-950; G-583 to S-950; 
K-584 to S-950; R-585 to S-950; V-586 to S-950; R-587 to S-950; Y-588 to S- 
950; R-589 to S-950; S-590 to S-950; C-591 to S-950; N-592 to S-950; L-593 
to S-950; E-594 to S-950; D-595 to S-950; C-596 to S-950; P-597 to S-950; D- 
598 to S-950; N-599 to S-950; N-600 to S-950; G-601 to S-950; K-602 to S-950; 
T-603 to S-950; F-604 to S-950; R-605 to S-950; E-606 to S-950; E-607 to S- 
950; Q-608 to S-950; C-609 to S-950; E-610 to S-950; A-61 1 to S-950; H-612 
to S-950; N-613 to S-950; E-614 to S-950; F-615 to S-950; S-616 to S-950; K- 
617 to S-950; A-61 8 to S-950; S-619 to S-950; F-620 to S-950; G-621 to S-950; 
S-622 to S-950; G-623 to S-950; P-624 to S-950; A-625 to S-950; V-626 to S- 
950; E-627 to S-950; W-628 to S-950; 1-629 to S-950; P-630 to S-950; K-631 
to S-950; Y-632 to S-950; A-633 to S-950; G-634 to S-950; V-635 to S-950; S- 
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636 to S-950; P-637 to S-950; K-638 to S-950; D-639 to S-950; R-640 to S-950; 
C-641 to S-950; K-642 to S-950; L-643 to S-950; 1-644 to S-950; C-645 to S- 
950; Q-646 to S-950; A-647 to S-950; K-648 to S-950; G-649 to S-950; 1-650 
to S-950; G-651 to S-950; Y-652 to S-950; F-653 to S-950; F-654 to S-950; V- 
655 to S-950; L-656 to S-950; Q-657 to S-950; P-658 to S-950; K-659 to S-950; 
V-660 to S-950; V-661 to S-950; D-662 to S-950; G-663 to S-950; T-664 to S- 
950; P-665 to S-950; C-666 to S-950; S-667 to S-950; P-668 to S-950; D-669 to 
S-950; S-670 to S-950; T-671 to S-950; S-672 to S-950; V-673 to S-950; C-674 
to S-950; V-675 to S-950; Q-676 to S-950; G-677 to S-950; Q-678 to S-950; C- 
679 to S-950; V-680 to S-950; K-68 1 to S-950; A-682 to S-950; G-683 to S-950; 
C-684 to S-950; D-685 to S-950; R-686 to S-950; 1-687 to S-950; 1-688 to S- 
950; D-689 to S-950; S-690 to 8-950^-691 to S-950, K-692 to S-950; K-693 
to S-950; F-694 to S-950; D-695 to S-950; K-696 to S-950; C-697 to S-950; G- 
698 to S-950; V-699 to S-950; C-700 to S-950; G-701 to S-950; G-702 to S-950; 
N-703 to S-950; G-704 to S-950; S-705 to S-950; T-706 to S-950; C-707 to S- 
950; K-708 to S-950; K-709 to S-950; 1-710 to S-950; S-71 1 to S-950; G-712 to 
S-950; S-7 13 to S-950; V-714 to S-950; T-71 5 to S-950; S-71 6 to S-950; A-7 1 7 
to S-950; K-718 to S-950; P-719 to S-950; G-720 to S-950; Y-721 to S-950; H- 
722 to S-950; D-723 to S-950; 1-724 to S-950; 1-725 to S-950; T-726 to S-950; 
1-727 to S-950; P-728 to S-950; T-729 to S-950; G-730 to S-950; A-731 to S- 
950; T-732 to S-950; N-733 to S-950; 1-734 to S-950; E-735 to S-950; V-736 to 
S-950; K-737 to S-950; Q-738 to S-950; R-739 to S-950; N-740 to S-950; Q-741 
to S-950; R-742 to S-950; G-743 to S-950; S-744 to S-950; R-745 to S-950; N- 
746 to S-950; N-747 to S-950; G-748 to S-950; S-749 to S-950; F-750 to S-950; 
L-751 to S-950; A-752 to S-950; 1-753 to S-950; K-754 to S-950; A-755 to S- 
950; A-756 to S-950; D-757 to S-950; G-758 to S-950; T-759 to S-950; Y-760 
to S-950; 1-761 to S-950; L-762 to S-950; N-763 to S-950; G-764 to S-950; D- 
765 to S-950; Y-766 to S-950; T-767 to S-950; L-768 to S-950; S-769 to S-950; 
T-770 to S-950; L-771 to S-950; E-772 to S-950; Q-773 to S-950; D-774 to S- 
950; 1-775 to S-950; M-776 to S-950; Y-777 to S-950; K-778 to S-950; G-779 
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to S-950; V-780 to S-950; V-781 to S-950; L-782 to S-950; R-783 to S-950; Y- 
784 to S-950; S-785 to S-950; G-786 to S-950; S-787 to S-950; S-788 to S-950; 
A-789 to S-950; A-790 to S-950; L-791 to S-950; E-792 to S-950; R-793 to S- 
950; 1-794 to S-950; R-795 to S-950; S-796 to S-950; F-797 to S-950; S-798 to 
S-950; P-799 to S-950; L-800 to S-950; K-801 to S-950; E-802 to S-950; P-803 
to S-950; L-804 to S-950; T-805 to S-950; 1-806 to S-950; Q-807 to S-950; V- 
808 to S-950; L-809 to S-950; T-810 to S-950; V-81 1 to S-950; G-812 to S-950; 
N-813 to S-950; A-814 to S-950; L-815 to S-950; R-816 to S-950; P-817 to S- 
950; K-8 1 8 to S-950; 1-8 1 9 to S-950; K-820 to S-950; Y-821 to S-950; T-822 to 
S-950; Y-823 to S-950; F-824 to S-950; V-825 to S-950; K-826 to S-950; K-827 
to S-950; K-828 to S-950; K-829 to S-950; E-830 to S-950; S-83 1 to S-950; F- 
832 to-S-950; N-833 to S-950; A-834AO S-950; 1-835 to S-950; P-836 to S-950; 
T-837 'to S-950; F-838 to S-9507S-~839to S-950; A-840 to S-950; W-841 to S^ 
950- V-842 to S-950; 1-843 to S-950; E-844 to S-950; E-845 to S-950; W-846 
to S-950; G-847 to S-950; E-848 to S-950; C-849 to S-950; S-850 to S-950; K- 
851 to S-950; S-852 to S-950; C-853 to S-950; E-854 to S-950; L-855 to S-950; 
G-856 to S-950; W-857 to S-950; Q-858 to S-950; R-859 to S-950; R-860 to S- 
950; L-861 to S-950; V-862 to S-950; E-863 to S-950; C-864 to S-950; R-865 
to S-950; D-866 to S-950; 1-867 to S-950; N-868 to S-950; G-869 to S-950; Q- 
870 to S-950; P-871 to S-950; A-872 to S-950; S-873 to S-950; E-874 to S-950; 
C-875 to S-950; A-876 to S-950; K-877 to S-950; E-878 to S-950; V-879 to S- 
950; K-880 to S-950; P-881 to S-950; A-882 to S-950; S-883 to S-950; T-884 
to S-950; R-885 to S-950; P-886 to S-950; C-887 to S-950; A-888 to S-950; D- 
889 to S-950; H-890 to S-950; P-891 to S-950; C-892 to S-950; P-893 to S-950; 
Q-894 to S-950; W-895 to S-950; Q-896 to S-950; L-897 to S-950; G-898 to S- 
950; E-899 to S-950; W-900 to S-950; S-901 to S-950; S-902 to S-950; C-903 
to S-950; S-904 to S-950; K-905 to S-950; T-906 to S-950; C-907 to S-950; G- 
908 to S-950; K-909 to S-950; G-9 1 0 to S-950; Y-9 1 1 to S-950; K-9 1 2 to S-950; 
K-913 to S-950; R-914 to S-950; S-915 to S-950; L-916 to S-950; K-917 to S- 
950- C-918 to S-950; L-919 to S-950; S-920 to S-950; H-921 to S-950; D-922 
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to S-950; G-923 to S-950; G-924 to S-950; V-925 to S-9S0; L-926 to S-950; S- 
927 to S-950; H-928 to S-950; E-929 to S-950; S-930 to S-950; C-93 1 to S-950; 
D-932 to S-950; P-933 to S-950; L-934 to S-950; K-935 to S-950; K-936 to S- 
950; P-937 to S-950; K-938 to S-950; H-939 to S-950; F-940 to S-950; 1-941 to 
5 S-950; D-942 to S-950; F-943 to S-950; C-944 to S-950; T-945 to S-950; of SEQ 

ID NO:2. 

Moreover, C-terminal deletions of the METH1 polypeptide can also be 
described by the general formula 1-n, where n is an integer from 2 to 950, where 
n corresponds to the position of amino acid residue identified in SEQ ID NO:2. 
10 Preferably, C-terminal deletions of the METH1 polypeptide of the invention 

shown as SEQ ID NO:2 include polypeptides comprising the amino acid sequence 
of residues. M,LtoX-949; M-l to E-948; M-l to A-947; M-l to M-946; M-l to 
T-945; M-l to C-944; M-l to F-943; M-l to D-942; M-l to 1-941; M-l to F-940; 
M-l to H-939; M-l to K-938; M-l to P-937; M-l to K-936; M-l to K-935; M-l 
15 toL-934;M-l toP-933;M-l toD-932;M-l toC-931;M-l toS-930;M-l toE- 

929; M-l to H-928; M-l to S-927; M-l to L-926; M-l to V-925; M-l to G-924; 
M-l to G-923; M-l toD-922; M-l to H-921; M-l to S-920; M-l to L-919; M-l 
toC-918;M-l toK-917;M-l toL-916;M-l toS-915;M-l toR-914;M-l toK- 
913;M-ltoK-912;M-l to Y-911; M-l to G-910;M-1 toK-909;M-l toG-908; 
20 M-l to C-907; M-l to T-906; M-l to K-905; M-l to S-904; M-l to C-903; M-l 

toS-902;M-l toS-901;M-l toW-900;M-l toE-899;M-l to G-898; M-l toL- 
897; M-l to Q-896; M-l to W-895; M-l to Q-894; M-l to P-893; M-l to C-892; 
M-l toP-891;M-ltoH-890;M-l toD-889;M-l to A-888;M-1 toC-887;M-l 
to P-886; M-l to R-885; M-l to T-884; M-l to S-883; M-l to A-882; M-l to P- 
25 881; M-l to K-880; M-l to V-879; M-l to E-878; M-l to K-877; M-l to A-876; 

M-l to C-875; M-l to E-874; M-l to S-873; M-l to A-872; M-l to P-871; M-l 
to Q-870; M-l to G-869; M-l to N-868; M-l to 1-867; M-l to D-866; M-l to R- 
865; M-l to C-864; M-l to E-863; M-l to V-862; M-l to L-861; M-l to R-860; 
M-l toR-859;M-l toQ-858;M-l toW-857;M-l toG-856;M-l toL-855;M-l 
30 to E-854; M-l to C-853; M-l to S-852; M-l to K-85 1 ; M-l to S-850; M-l to C- 
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849; M-1 to E-848; M-1 to G-847; M-1 to W-846; M-1 to E-845; M-1 to E-844; 
M-1 to 1-843; M-1 to V-842; M-1 to W-841; M-1 to A-840; M-1 to S-839; M-1 
toF-838;M-l toT-837;M-l toP-836; M-1 to 1-835; M-1 to A-834; M-1 to N- 
833; M-1 to F-832; M-1 to S-831.M-1 toE-830;M-l toK-829;M-l toK-828; 
M-1 to K-827; M-1 to K-826; M-1 to V-825; M-1 to F-824; M-1 to Y-823; M-1 
to T-822; M-1 to Y-821; M-1 to K-820; M-1 to 1-819; M-1 to K-818; M-1 to P- 
817; M-1 to R-816; M-1 to L-81 5; M-1 to A-814; M-1 toN-813; M-1 to G-812; 
M-1 to V-81 1 ; M-1 to T-810; M-1 to L-809; M-1 to V-808; M-1 to Q-807; M-1 
to 1-806; M-1 to T-805; M-1 to L-804; M-1 to P-803; M-1 to E-802; M-1 to K- 
801; M-1 to L-800; M-1 to P-799; M-1 to S-798; M-1 to F-797; M-1 to S-796; 
M-1 to R-795; M-1 to 1-794; M-1 to R-793; M-1 to E-792; M-1 to L-791; M-1 
-to A-790; M-1 to A-789; M-1 to S-788; M-1. to S-787; M-1 to G-786; M-1 to S- 
" "785; M-1 to Y-784; M-1 to R-783; M-1 to L-782; M-1 to V-781; M-1 to V-780; 
M-1 to G-779; M-1 to K-778; M-1 to Y-777; M-1 to M-776; M-1 to 1-775; M-1 
15 to D-774; M-1 to Q-773; M-1 to E-772; M-1 toL-771; M-1 to T-770; M-1 to S- 

769; M-1 to L-768; M-1 to T-767; M-1 to Y-766; M-1 to D-765; M-1 to G-764; 
M-1 toN-763; M-1 to L-762; M-1 to 1-761; M-1 to Y-760; M-1 to T-759; M-1 
to G-758; M-1 to D-757; M-1 to A-756; M-1 to A-755; M-1 to K-754; M-1 to 
1-753; M-1 to A-752; M-1 to L-75 1 ; M-1 to F-750; M-1 to S-749; M-1 to G-748; 
20 M-1 to N-747; M-1 to N-746; M-1 to R-745; M-1 to S-744; M-1 to G-743; M-1 

to R-742; M-1 to Q-741; M-1 to N-740; M-1 to R-739; M-1 to Q-738; M-1 to 
K-737; M-1 to V-736; M-1 to E-735; M-1 to 1-734; M-1 to N-733; M-1 to T- 
732; M-1 to A-731; M-1 to G-730; M-1 to T-729; M-1 to P-728; M-1 to 1-727; 
M-1 to T-726; M-1 to 1-725; M-1 to 1-724; M-1 to D-723; M-1 to H-722; M-1 
25 to Y-721; M-1 to G-720; M-1 to P-719; M-1 to K-718; M-1 to A-717; M-1 to S- 

716; M-1 toT-715;M-l to V-714; M-1 to S-713;M-1 toG-712;M-l toS-711; 
M-1 to 1-710; M-1 to K-709; M-1 to K-708; M-1 to C-707; M-1 to T-706; M-1 
to S-705; M-1 to G-704; M-1 to N-703; M-1 to G-702; M-1 to G-701 ; M-1 to C- 
700; M-1 to V-699; M-1 to G-698; M-1 to C-697; M-1 to K-696; M-1 to D-695; 
30 M-1 to F-694; M-1 to K-693; M-1 to K-692; M-1 to K-691; M-1 to S-690; M-1 
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to D-689; M-l to 1-688; M-l to 1-687; M-l to R-686; M-l to D-685; M-l to C- 
684; M-l to G-683.M-1 to A-682; M-l toK-681;M-l to V-680; M-l to C-679; 
M-l to Q-678; M-l to G-677; M-l to Q-676; M-l to V-675; M-l to C-674; M-l 
to V-673; M-l to S-672; M-l to T-671; M-l to S-670; M-l to D-669; M-l to P- 
668; M-l to S-667; M-l to C-666; M-l to P-665; M-l to T-664; M-l to G-663; 
M-l to D-662; M-I to V-661; M-l to V-660; M-l to K-659; M-l to P-658; M-l 
toQ-657;M-l toL-656;M-l toV-655;M-l toF-654;M-l toF-653;M-l to Y- 
652; M-l to G-651; M-l to 1-650; M-l to G^649; M-l to K-648; M-l to A-647; 
M-l to Q-646; M-l to C-645; M-l to 1-644; M-l to L-643; M-l to K-642; M-l 
to C-641 ; M-l to R-640; M-l to D-639; M-l to K-638; M-l to P-637; M-l to S- 
636; M-l to V-635; M-l to G-634; M-l to A-633; M-l to Y-632; M-l to K-631; 
M-l to P-630; M-l to 1-629; M-l to W-628; M-l to E-627; M-l to V-626; M^l 
to A-625; M-l to P-624; M-l to G-623; M-l to S-622; M-fto G-621 ; M-l to F- 
620; M-l to S-619; M-l to A-618; M-l to K-617; M-l to S-616; M-l to F-61 5; 
M-l toE-614;M-l toN-613;M-l to H-612;M-1 to A-611;M-1 toE-610;M-l 
to C-609; M-l to Q-608; M-l to E-607; M-l to E-606; M-l to R-605; M-l to F- 
604; M-l to T-603; M-l to K-602; M-l to G-601 ; M-l to N-600; M-l to N-599; 
M-l to D-598; M-l to P-597; M-l to C-596; M-l to D-595; M-l to E-594; M-l 
toL-593;M-l toN-592;M-l toC-591;M-l to S-590; M-ltoR-589; M-l toY- 
588; M-l toR-587;M-l to V-586;M-1 toR-585;M-l toK-584;M-l to G-583; 
M-l to E-582; M-l to C-581; M-l to Y-580; M-l to K-579; M-l to G-578; M-l 
to G-577; M-l to N-576; M-l to K-575; M-l to P-574; M-l to V-573; M-l to P- 
572; M-l to N-571; M-l to D-570; M-l to C-569; M-l to E-568; M-l to R-567; 
M-l to M-566; M-l toT-565; M-l to Y-564; M-l to Q-563; M-l to V-562; M-l 
to G-561 ; M-l to G-560; M-l to G-559; M-l to C-558; M-l to T-557; M-l to R- 
556; M-l to S-555;M-1 toC-554;M-l toD-553;M-l toG-552;M-l toW-551; 
M-l to P-550; M-l to G-549; M-l to W-548; M-l to M-547; M-l to G-546; M-l 
to W-545; M-l to S-544; M-l to G-543; M-l to H-542; M-l to F-541; M-l to 
P-540; M-l to T-539; M-l to D-538; M-l to F-537; M-l to H-536; M-l to K- 
535; M-l toR-534;M-l toD-533;M-l toT-532;M-l toK-531;M-l toN-530; 
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M-1 to V-529;M-1 to C-528;M-1 toK-527;M-l to G-526;M-1 toN-525; M-l 
to 1-524; M-l toC-523;M-l toW-522;M-l toK-521;M-l toG-520; M-l to E- 
519; M-l toG-518;M-l toC-517;M-l toS-516;M-l toT-515;M-l to G-514; 
M-l toD-513;M-l to A-512; M-l to W-511;M-1 toP-510;M-l toF-509;M-l 
toH-508; M-l toK-507;M-l toT-506;M-l toQ-505;M-l toC-504; M-l to V- 
503; M-l to L-502; M-l to V-501 ; M-l to G-500; M-l to G-499; M-l to S-498; 
M-l to T-497; M-l to G-496; M-l to T-495; M-l to C-494; M-l to W-493; M-l 
to L-492; M-l to T^91; M-l to S-490; M-l to C-489; M-l to T-488; M-l to S- 
487; M-l to A-486; M-l to A-485; M-l to D-484; M-l to P-483; M-l to C-482; 
M- 1 to H-48 1 ; M- 1 to K-480; M- 1 to S-479; M- 1 to D-478; M- 1 to E-477; M- 1 
toG-476; M-l toF-475; M-l to T-474;M-1 toF-473;M-l toQ-472; M-l toC- 
471; M-l to Q-470; M-l toR-469;M-l toN-468;M-l to A-467;M-1 to D-466; 
M-l to Y-465; M-l to S-464; M-l to-T-463; M-l to G-462; M-l to P-461; M-l 
toL-460; M-l toD-459;M-l toG-458;M-l toP-457; M-l toL-456; M-l to Q- 
455; M-l to 1-454; M-l to P-453; M-l to N-452; M-l to Q-451; M-l to P-450; 
M-l to K-449; M-l to D-448; M-l to M-447; M-l to L-446; M-l to C-445; M-l 
to E-444; M-l to G-443; M-l to H-442; M-l to G-441; M-l to N-440; M-l to 
D-439; M-l toL-438;M-l toF-437; M-l to S-436; M-l to T-435; M-l to 1-434; 
M-l to M-433; M-l to Y-432; M-l to A-431; M-l to S-430; M-l to CM29; M-l 
toP-428;M-l to S-427; M-l to W-426; M-l toP-425; M-l toQ-424; M-l to S- 
423; M-l toH-422; M-l toD-421;M-l toL-420;M-l toN-419;M-l toS-418; 
M- 1 to L-4 1 7 ; M- 1 to M-4 1 6; M- 1 to S-4 1 5 ; M- 1 to A-4 1 4; M- 1 to M-4 1 3 ; M- 1 
to M-412; M-l to H-411; M-l to S-410; M-l to D-409; M-l to Q-408; M-l to 
N-407; M-l to V-406; M-l to G-405; M-l to N-404; M-l to L-403; M-l to S- 
402; M-l to A-401; M-l to C-400; M-l to Q-399; M-l to K-398; M-l to A-397; 
M-3 to D-396; M-l to D-395; M-l to H-394; M-l to P-393; M-l to M-392; M-l 
toN-391;M-l toF-390;M-l to V-389; M-l toH-388;M-l toG-387; M-l toL- 
386; M-l toE-385;M-l toH-384; M-l to A-383;M-1 toT-382;M-l toT-381; 
M-l to F-380; M-l to A-379; M-l to A-378; M-l to Q-377; M-l to L-376; M-l 
toG-375;M-l toD-374; M-l to D-373; M-l toE-372; M-l to 1-371; M-l to V- 
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370; M-l to S-369; M-l to C-368; M-l to S-367; M-l to R-366; M-l to S-365; 
M-l toP-364; M-l to D-363; M-l to C-362; M-l to V-361; M-l toT-360; M-l 
to G-359; M-l to V-358; M-l to D-357; M-l to A-356; M-l to M-355; M-l to 
G-354; M-l to L-353; M-l to T-352; M-l to D-351; M-l to C-350; M-l to T- 
5 349; M-l to Q-348; M-l to S-347; M-l to G-346; M-l to C-345; M-l to L-344; 

M-l to D-343; M-l to Q-342; M-l to R-341; M-l to T-340; M-l to F-339; M-l 
to L-338; M-l to 1-337; M-l to A-336; M-l to T-335; M-l to D-334; M-l to Y- 
333;M-1 toH-332; M-l toE-331;M-l to A-330;M-1 toD-329; M-l to R-328; 
M-l to D-327; M-l to S-326; M-l to P-325; M-l to P-324; M-l to N-323; M-l 

10 toH-322;M-l toQ-321;M-l to K-320; M-l to Q-3 19; M-l toW-318;M-l to 

N-317;M-1 toC-316; M-l toF-315;M-l toN-314; M-l toR-313;M-l to L- 
312; M-l toT-311;M-l toL-310;M-l toA-309;M-l to A-308;M-1 toN-307; 

— - -M-l to S-306;-M-l to T-305; M-l to V-304; M-l to E-303; M-l to P-302; M-l 

to G-301; M-l to K-300; M-l to Q-299; M-l to E-298; M-l to D-297; M-l to 

15 H-296; M-l to 1-295; M-l to V-294; M-l to L-293; M-l to 1-292; M-l to K-291 ; 

M-l to V-290; M-l to V-289; M-l to V-288; M-l to L-287; M-l to S-286; M-l 
to V-285; M-l to S-284; M-l to N-283; M-l to R-282; M-l to 1-281; M-l to S- 
280; M-l to P-279; M-l to H-278; M-l to K-277; M-l to Y-276; M-l to L-275; 
M-l to R-274; M-l to A-273; M-l to A-272; M-l to V-271 ; M-l to S-270; M-l 

20 to F-269; M-l to L-268; M-l to T-267; M-l to L-266; M-l to L-265; M-l to Y- 

264; M-l toH-263; M-l toK-262;M-l toL-261;M-l toG-260;M-l to S-259; 
M-l to G-258; M-l to H-257; M-l to F-256; M-l to E-255; M-l to A-254; M-l 
to M-253; M-l to S-252; M-l to Q-251; M-l to D-250; M-l to A-249; M-l to 
V-248; M-l to L-247; M-l to M-246; M-l to T-245; M-l to E-244; M-l to V- 

25 243 ; M- 1 to Y-242; M- 1 to R-24 1 ; M- 1 to H-240; M- 1 to S-23 9; M- 1 to S-23 8 ; 

M-l to V-237; M-l to F-236; M-l to R-235; M-l to K-234; M-l to K-233; M-l 
to R-232; M-l to 1-23 1; M-l to S-230; M-l to G-229; M-l to T-228; M-l to G- 
227; M-l to T-226;'M-1 to P-225; M-l to Q-224; M-l to G-223; M-l to V-222; 
M-l toG-221;M-l to Q-220; M-l to L-219; M-l to A-218; M-l to P-217; M-l 

30 to D-216; M-l to Q-215; M-l to P-214; M-l to S-213; M-l to W-212; M-l to 
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5-211; M-l to P-210; M-l to G-209; M-l to E-208; M-l to D-207; M-l to E- 
106; M-l to G-205; M-l to E-204; M-l to T-203; M-l to G-202; M-l to E-201; 
vl-1 toD-200; M-l to E-199; M-l to D- 198; M-l to E- 197; M-l toT-196; M-l 
oE-195;M-l toA-194;M-l toK-193;M-l to G-192; M-l toT-191;M-l toP- 
! 90; M-l toR-189;M-l toP-188; M-l toE-187; M-l toD-186; M-l toD-185; 
vl-1 to V-184; M-l to V-183;M-1 to G-182; M-l to C-181; M-l toT-180; M-l 
o G-179; M-l to G-178; M-l to V-177; M-l to D-176; M-l to G-175; M-l to 
3-174; M-l toR-173;M-l toN-172; M-l to R-171; M-l toR-170;M-l to L- 
.69; M-l toL-168;M-l toH-I67;M-l toF-166;M-l toQ-165;M-l to L-164; 
vl-1 to P-163;M-1 to A-162; M-l to P-161; M-l to P-160; M-l toK-159; M-l 
oE-158;M-l toG-157;M-l to P-156;M-1 to A-155;M-1 to A-154;M-1 toT- 
53;M-1 to A-152;M-1 toL-151;M-l toR-150;M-l toE-149;M-l to S-148; 
vl-l_to A-147; M-l to A-146; M^l toJP r 145; M-l to L-144; M-lto P-1.43; M-l 
oQ-142;M-l to 1-141; M-l toF-140;M-l to Y-139; M-l to A-138; M-l toE- 
.37; M-l to G-136;M-1 toL-135;M-l toL-134;M-l to Y-133;M-1 toF-132; 
vl-1 to A-13 1; M-l to G-130; M-l to R-129; M-l to V-128; M-I to G-127; M-l 
o E-126; M-l to C-125; M-l to L-124; M-l to S-123; M-l to L-122; M-l to A- 
.21; M-l to A-120;M-1 to A-119;M-1 to S-118;M-1 toS-117;M-l toP-116; 
vl-1 to D-l 1 5; M-l to G-l 14; M-l to N-l 13; M-l to V-l 12; M-l to T-l 1 1; M-l 
o G-l 10; M-l to S-109; M-l to Y-108; M-l toF-107; M-l to C-106; M-l to H- 
.05; M-l to A-104; M-l to L-103; M-l toD-102; M-l to T-101; M-l to E-100; 
vl-1 to P-99; M-l to L-98; M-l to P-97; M-l to T-96; M-l to E-95; M-l to S-94; 
vl-1 to G-93; M-l to S-92; M-l to K-91; M-l to R-90; M-l to G-89; M-l to V- 
i8; M-l to N-87; M-l to Q-86; M-l to L-85; M-l to T-84; M-l to F-83; M-l to 
3-82; M-l to P-81; M-l to A-80; M-l to L-79; M-l to F-78; M-l to S-77; M-l 
oS-76;M-l toD-75;M-l toP-74; M-l toR-73;M-l toL-72;M-l toE-71; M- 
to L-70; M-l to D-69; M-l to L-68; M-l to Q-67; M-l to Q-66; M-l to D-65; 
vl-1 to F-64; M-l to A-63; M-l to H-62; M-l to L-61 ; M-l to R-60; M-l to L- 
i9; M-l to R-58; M-l to T-57; M-l to T-56; M-l to G-55; M-l to H-54; M-l to 
j-53; M-l to P-52; M-l to A-51; M-l to R-50; M-l to E-49; M-l to L-48; M-l 
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to E-47; M-l to P-46; M-l to V-45; M-l to V-44; M-l to L-43; M-l to E-42; M- 
1 to E-41; M-l to D-40; M-l to E-39; M-l to E-38; M-l to S-37; M-l to P-36; 
M-l toR-35;M-l to G-34; M-l to L-33;M-1 toA-32; M-l toD-31;M-l to S- 
30; M-l to V-29; M-l to A-28; M-l to L-27; M-l to L-26; M-l to A-25; M-l to 
A-24; M-l to A-23; M-l to L-22; M-l to L-21; M-l to L-20; M-l to L-19; M-l 
toT-18;M-l toP-17;M-l to V-16; M-l toP-15;M-l toG-14;M-I toF-13;M- 
1 to S-12; M-l to R-l 1; M-l to S-10; M-l to G-9; M-l to P-8; M-l to A-7; of 
SEQ ID NO:2. For example, any of the above listed N- or C-terminal deletions 
can be combined to produce a N- and C-terminal deleted METH1 polypeptide. 

Moreover, N-terminal deletions of the METH2 polypeptide can be 
described by the general formula m-890, where m is an integer from 2 to 889, 
where m corresponds to the position of the amino acid residue identified in SEQ 
ID NO:4. Preferably, Nrterminal deletions of the METH2 polypeptide of the 
invention shown as SEQ ID NO:4 include polypeptides comprising the amino acid 
sequence of residues: F-2 to L-890; P-3 to L-890; A-4 to L-890; P-5 to L-890; 
A-6 to L-890; A-7 to L-890; P-8 to L-890; R-9 to L-890; W-10 to L-890; L-l 1 
to L-890; P-12 to L-890; F-13 to L-890; L-l 4 to L-890; L-l 5 to L-890; L-l 6 to 
L-890; L-l 7 to L-890; L-l 8 to L-890; L-19 to L-890; L-20 to L-890; L-21 to L- 



30 



890; L-22 to L-890 
890; G-27 to L-890: 
890; P-3 2 to L-890: 
890; Q-3 7 to L-890 
890; V-42 to L-890 
890; L-47 to L-890 
890; G-52 to L-890 
890; H-57 to L-890 
890; G-62 to L-890 
890; L-67 to L-890 
890; D-72 to L-890 
890; A-77 to L-890 



P-23 to L-890; L-24 to L-890; A-25 to L-890; R-26 to L- 
A-28 to L-890; P-29 to L-890; A-30 to L-890; R-3 1 to L- 
A-33 to L-890; A-34 to L-890; G-35 to L-890; G-36 to L- 
A-38 to L-890; S-39 to L-890; E-40 to L-890; L-41 to L- 
V-43 to L-890; P-44 to L-890; T-45 to L-890; R-46 to L- 
P-48 to L-890; G-49 to L-890; S-50 to L-890; A-51 to L- 
E-53 to L-890; L-54 to L-890; A-55 to L-890; L-56 to L- 
L-58 to L-890; S-59 to L-890; A-60 to L-890; F-61 to L- 
K-63 to L-890; G-64 to L-890; F-65 to L-890; V-66 to L- 
R-68 to L-890; L-69 to L-890; A-70 to L-890; P-71 to L- 
D-73 to L-890; S-74 to L-890; F-75 to L-890; L-76 to L- 
P-78 to L-890; E-79 to L-890; F-80 to L-890; K-81 to L- 
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890; 1-82 to L-890; E-83 to L-890; R-84 to L-890; L-85 to L-890; G-86 to L- 
890; G-87 to L-890; S-88 to L-890; G-89 to L-890; R-90 to L-890; A-9] to L- 
890; T-92 to L-890; G-93 to L-890; G-94 to L-890; E-95 to L-890, R-96 to L- 
890; G-97 to L-890; L-98 to L-890; R-99 to L-890; G- 1 00 to L-890; C- 1 0 1 to L- 
890; F-102 to L-890; F-103 to L-890; S-104 to L-890; G-105 to L-890; T-106 
to L-890; V-) 07 to L-890; N-108 to L-890; G-109 to L-890; E-l 1 0 to L-890; P- 
1 ] I to L-890; E- 1 1 2 to L-890; S- 1 1 3 to L-890; L- 1 ] 4 to L-890; A- 1 1 5 to L-890; 
A-l 16 to L-890; V-l 17 to L-890; S-l 18 to L-890; L-l 19 to L-890; C-120 to L- 
890; R-121 to L-890; G-122 to L-890; L-123 to L-890; S-124 to L-890; G-125 
to L-890; S-l 26 to L-890; F-127 to L-890; L-l 28 to L-890; L-l 29 to L-890; D- 
1 30 to L-890; G-l 3 1 to L-890; E-l 32 to L-890; E-l 33 to L-890; F-l 34 to L-890; 
T-135 to L-890; 1-136 to L-890; Q-137 to L-890; P-138 to L-890; Q-139 to L- 
890; G-140 to L-890; A-141 to L-890; G-142 to L-890; G- I43 to L-890; S-144 
to L-890; L-l 45 to L-890; A- 146 to L-890; Q-147 to L-890; P-148 to L-890; H- 
149 to L-890; R-l 50 to L-890; L-l 5 1 to L-890; Q-152 to L-890; R-l 53 to L-890; 
W-154 to L-890; G-155 to L-890; P-156 to L-890; A-157 to L-890; G-158 to L- 
890; A-l 59 to L-890; R-160 to L-890; P-161 to L-890; L-162 to L-890; P-163 
to L-890; R-164 to L-890; G-165 to L-890; P-l 66 to L-890; E-167 to L-890; W- 
1 68 to L-890; E-l 69 to L-890; V-l 70 to L-890; E-l 71 to L-890; T-172 to L-890; 
G-l 73 to L-890; E-l 74 to L-890; G-l 75 to L-890; Q-176 to L-890; R-l 77 to L- 
890; Q-178 to L-890; E-179 to L-890; R-l 80 to L-890; G-l 81 to L-890; D-l 82 
to L-890; H-l 83 to L-890; Q-184 to L-890; E-l 85 to L-890; D-l 86 to L-890; S- 
1 87 to L-890; E- 1 88 to L-890; E- 1 89 to L-890; E-l 90 to L-890; S- 1 9 1 to L-890; 
Q-192 to L-890; E-l 93 to L-890; E-l 94 to L-890; E-l 95 to L-890; A- 196 to L- 
890; E-197 to L-890; G-198 to L-890; A-199 to L-890; S-200 to L-890; E-201 
to L-890; P-202 to L-890; P-203 to L-890; P-204 to L-890; P-205 to L-890; L- 
206 to L-890; G-207 to L-890; A-208 to L-890; T-209 to L-890; S-2 1 0 to L-890; 
R-21 1 to L-890; T-212 to L-890; K-213 to L-890; R-214 to L-890; F-215 to L- 
890; V-216 to L-890; S-2 17 to L-890; E-218 to L-890; A-219 to L-890; R-220 
to L-890; F-221 to L-890; V-222 to L-890; E-223 to L-890; T-224 to L-890; L- 
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225 to L-890; L-226 to L-890; V-227 to L-890; A-228 to L-890; D-229 to L-890; 
A-230 to L-890; S-23 1 to L-890; M-232 to L-890; A-233 to L-890; A-234 to L- 
890; F-235 to L-890; Y-236 to L-890; G-237 to L-890; A-238 to L-890; D-239 
to L-890; L-240 to L-890; Q-241 to L-890; N-242 to L-890; H-243 to L-890; I- 
5 244 to L-890; L-245 to L-890; T-246 to L-890; L-247 to L-890; M-248 to L-890; 

S-249 to L-890; V-250 to L-890; A-251 to L-890; A-252 to L-890; R-253 to L- 
890; 1-254 to L-890; Y-255 to L-890; K-256 to L-890; H-257 to L-890; P-258 
to L-890; S-259 to L-890; 1-260 to L-890; K-261 to L-890; N-262 to L-890; S- 
263 to L-890; 1-264 to L-890; N-265 to L-890; L-266 to L-890; M-267 to L-890; 

10 V-268 to L-890; V-269 to L-890; K-270 to L-890; V-271 to L-890; L-272 to L- 

890; 1-273 to L-890; V-274 to L-890; E-275 to L-890; D-276 to L-890; E-277 
to L-890; K-278 to L-890; W-279 to L-890; G-280 to L-890; P-281 to L-890; E- 
282 to L-890; V-283 to L-890; S-284 to L-890; D-285 to L-890; N-286 to L-890; 
G-287 to L-890; G-288 to L-890; L-289 to L-890; T-290 to L-890; L-291 to L- 

15 890; R-292 to L-890; N-293 to L-890; F-294 to L-890; C-295 to L-890; N-296 

to L-890; W-297 to L-890; Q-298 to L-890; R-299 to L-890; R-300 to L-890; F- 
301 to L-890; N-302 to L-890; Q-303 to L-890; P-304 to L-890; S-305 to L-890; 
D-306 to L-890; R-307 to L-890; H-308 to L-890; P-309 to L-890; E-3 10 to L- 
890; H-3 1 1 to L-890; Y-3 1 2 to L-890; D-3 1 3 to L-890; T-3 1 4 to L-890; A-3 1 5 

20 to L-890; 1-3 1 6 to L-890; L-3 1 7 to L-890; L-3 1 8 to L-890; T-3 1 9 to L-890; R- 

320 to L-890; Q-321 to L-890; N-322 to L-890; F-323 to L-890; C-324 to L-890; 
G-325 to L-890; Q-326 to L-890; E-327 to L-890; G-328 to L-890; L-329 to L- 
890; C-330 to L-890; D-331 to L-890; T-332 to L-890; L-333 to L-890; G-334 
to L-890; V-335 to L-890; A-336 to L-890; D-337 to L-890; 1-338 to L-890; G- 

25 339 to L-890; T-340 to L-890; 1-341 to L-890; C-342 to L-890; D-343 to L-890; 

P-344 to L-890; N-345 to L-890; K-346 to L-890; S-347 to L-890; C-348 to L- 
890; S-349 to L-890; V-350 to L-890; 1-351 to L-890; E-352 to L-890; D-353 
to L-890; E-354 to L-890; G-355 to L-890; L-356 to L-890; Q-357 to L.890; A- 
358 to L-890; A-3 59 to L-890; H-360 to L-890; T-361 to L-890; L-3 62 to L-890; 

30 A-363 to L-890; H-364 to L-890; E-365 to L-890; L-366 to L-890; G-367 to L- 
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890; H-368 to L-890; V-369 to L-890; L-370 to L-890; S-371 to L-890; M-372 
to L-890; P-373 to L-890; H-374 to L-890; D-375 to L-890; D-376 to L-890; S- 
377 to L-890; K-378 to L-890; P-379 to L-890; C-380 to L-890; T-38 1 to L-890; 
R-382 to L-890; L-383 to L-890; F-384 to L-890; G-385 to L-890; P-386 to L- 
890; M-387 to L-890; G-388 to L-890; K-389 to L-890; H-390 to L-890; H-391 
to L-890; V-392 to L-890; M-393 to L-890; A-394 to L-890; P-395 to L-890; L- 
396 to L-890; F-397 to L-890; V-398 to L-890; H-399 to L-890; L-400 to L-890; 
N-401 to L-890; Q-402 to L-890; T-403 to L-890; L-404 to L-890; P-405 to L- 
890; W-406 to L-890; S-407 to L-890; P-408 to L-890; C-409 to L-890; S-410 
to L-890; A-4 11 to L-890; M-412 to L-890; Y-413 to L-890; L-414 to L-890; T- 
4 1 5 to L-890; E-4 1 6 to L-890; L-4 1 7 to L-890; L-4 1 8 to L-890; D-4 1 9 to L-890; 
G-420 to L-890; G-42 1 to L-890; H-422 to L-890; G-423 to L-890; D-424 to L- 
890; C-425 to L-890; L-426 to L-890; L-427 toL-890; D-428 to L-890; A-429 
to L-890; P-430 to L-890; G-43 1 to L-890; A-432 to L-890; A-433 to L-890; L- 
434 to L-890; P-435 to L-890; L-436 to L-890; P-437 to L-890; T-438 to L-890; 
G-43 9 to L-890; L-440 to L-890; P-441 to L-890; G-442 to L-890; R-443 to L- 
890; M-444 to L-890; A-445 to L-890; L-446 to L-890; Y-447 to L-890; Q-448 
to L-890; L-449 to L-890; D-450 to L-890; Q-45 1 to L-890; Q-452 to L-890; C- 
453 to L-890; R-454 to L-890; Q-455 to L-890; 1-456 to L-890; F-457 to L-890; 
G-458 to L-890; P-459 to L-890; D-460 to L-890; F-461 to L-890; R-462 to L- 
890; H-463 to L-890; C-464 to L-890; P-465 to L-890; N-466 to L-890; T-467 
to L-890; S-468 to L-890; A-469 to L-890; Q-470 to L-890; D-471 to L-890; V- 
472 to L-890; C-473 to L-890; A-474 to L-890; Q-475 to L-890; L-476 to L-890; 
W-477 to L-890; C-478 to L-890; H-479 to L-890; T-480 to L-890; D-48 1 to L- 
890; G-482 to L-890; A-483 to L-890; E-484 to L-890; P-485 to L-890; L-486 
to L-890; C-487 to L-890; H-488 to L-890; T-489 to L-890; K-490 to L-890; N- 
491 to L-890; G-492 to L-890; S-493 to L-890; L-494 to L-890; P-495 to L-890; 
W-496 to L-890; A-497 to L-890; D-498 to L-890; G-499 to L-890; T-500 to L- 
890; P-501 to L-890; C-502 to L-890; G-503 to L-890; P-504 to L-890; G-505 
to L-890; H-506 to L-890; L-507 to L-890; C-508 to L-890; S-509 to L-890; E- 
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5 1 0 to L-890; G-5 1 1 to L-890; S-5 1 2 to L-890; C-5 1 3 to L-890; L-5 ] 4 to L-890; 
P-5 1 5 to L-890; E-5 1 6 to L-890; E-5 1 7 to L-890; E-5 1 8 to L-890; V-5 ] 9 to L- 
890; E-520 to L-890; R-521 to L-890; P-522 to L-890; K-523 to L-890; P-524 
to L-890; V-525 to L-890; V-526 to L-890; D-527 to L-890; G-528 to L-890; G- 
529 to L-890; W-530 to L-890; A-53 1 to L-890; P-532 to L-890; W-533 to L- 
890; G-534 to L-890; P-535 to L-890; W-536 to L-890; G-537 to L-890; E-538 
to L-890; C-539 to L-890; S-540 to L-890; R-541 to L-890; T-542 to L-890; C- 
543 to L-890; G-544 to L-890; G-545 to L-890; G-546 to L-890; V-547 to L- 
890; Q-548 to L-890; F-549 to L-890; S-550 to L-890; H-551 to L-890; R-552 
to L-890; E-553 to L-890; C-554 to L-890; K-555 to L-890; D-556 to L-890; P- 
557 to L-890; E-558 to L-890; P-559 to L-890; Q-560 to L-890; N-561 to L-890; 
G-562 to L-890; G-563 to L-890; R-564 to L-890; Y-565 to L-890; C-566 to L- 
890; L-567 to L-890; G-568 to L-890; R-569 to L-890; R-570 to L-890; A-571 - 
to L-890; K-572 to L-890; Y-573 to L-890; Q-574to L-890; S-575 to L-890; C- 
576 to L-890; H-577 to L-890; T-578 to L-890; E-579 to L-890; E-580 to L-890; 
C-581 to L-890; P-582 to L-890; P-583 to L-890; D-584 to L-890; G-585 to L- 
890; K-586 to L-890; S-587 to L-890; F-588 to L-890; R-589 to L-890; E-590 
to L-890; Q-591 to L-890; Q-592 to L-890; C-593 to L-890; E-594 to L-890; K- 
595 to L-890; Y-596 to L-890; N-597 to L-890; A-598 to L-890; Y-599 to L- 
890; N-600 to L-890; Y-601 to L-890; T-602 to L-890; D-603 to L-890; M-604 
to L-890; D-605 to L-890; G-606 to L-890; N-607 to L-890; L-608 to L-890; L- 
609 to L-890; Q-610 to L-890; W-61 1 to L-890; V-612 to L-890; P-613 to L- 
890; K-614 to L-890; Y-6 1 5 to L-890; A-616 to L-890; G-617 to L-890; V-618 
to L-890; S-619 to L-890; P-620 to L-890; R-621 to L-890; D-622 to L-890; R- 
623 to L-890; C-624 to L-890; K-625 to L-890; L-626 to L-890; F-627 to L-890; 
C-628 to L-890; R-629 to L-890; A-630 to L-890; R-63 1 to L-890; G-632 to L- 
890; R-633 to L-890; S-634 to L-890; E-635 to L-890; F-636 to L-890; K-637 
to L-890; V-638 to L-890; F-639 to L-890; E-640 to L-890; A-641 to L-890; K- 
642 to L-890; V-643 to L-890; 1-644 to L-890; D-645 to L-890; G-646 to L-890; 
T-647 to L-890; L-648 to L-890; C-649 to L-890; G-650 to L-890; P-651 to L- 
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890; E-652 to L-890; T-653 to L-890; L-654 to L-890; A-655 to L-890; 1-656 to 
L-890; C-657 to L-890; V-658 to L-890; R-659 to L-890; G-660 to L-890; Q-661 
to L-890; C-662 to L-890; V-663 to L-890; K-664 to L-890; A-665 to L-890; G- 
666 to L-890; C-667 to L-890; D-668 to L-890; H-669 to L-890; V-670 to L- 
890; V-671 to L-890; D-672 to L-890; S-673 to L-890; P-674 to L-890; R-675 
to L-890; K-676 to L-890; L-677 to L-890; D-678 to L-890; K-679 to L-890; C- 
680 to L-890; G-681 to L-890; V-682 to L-890; C-683 to L-890; G-684 to L- 
890; G-685 to L-890; K-686 to L-890; G-687 to L-890; N-688 to L-890; S-689 
to L-890; C-690 to L-890; R-691 to L-890; K-692 to L-890; V-693 to L-890; S- 
694 to L-890; G-695 to L-890; S-696 to L-890; L-697 to L-890; T-698 to L-890; 
P-699 to L-890; T-700 to L-890; N-701 to L-890; Y-702 to L-890; G-703 to L- 
890; Y-704 to L-890; N-705 to L-890; D-706 to L-890; 1-707 to L-890; V-708 
to L-890; T-709 to L-890; 1-710 to L-890; P-71 1 to. L-890; A-712 to L-890; G- 
713 to L-890; A-714 to L-890; T-715 to L-890; N-716 to L-890; 1-717 to L-890; 
D-718 to L-890; V-719 to L-890; K-720 to L-890; Q-721 to L-890; R-722 to L- 
890; S-723 to L-890; H-724 to L-890; P-725 to L-890; G-726 to L-890; V-727 
to L-890; Q-728 to L-890; N-729 to L-890; D-730 to L-890; G-73 1 to L-890; N- 
732 to L-890; Y-733 to L-890; L-734 to L-890; A-735 to L-890; L-736 to L-890; 
K-737 to L-890; T-738 to L-890; A-739 to L-890; D-740 to L-890; G-741 to L- 
890; Q-742 to L-890; Y-743 to L-890; L-744 to L-890; L-745 to L-890; N-746 
to L-890; G-747 to L-890; N-748 to L-890; L-749 to L-890; A-750 to L-890; I- 
751 to L-890; S-752 to L-890; A-753 to L-890; 1-754 to L-890; E-755 to L-890; 
Q-756 to L-890; D-757 to L-890; 1-758 to L-890; L-759 to L-890; V-760 to L- 
890; K-761 to L-890; G-762 to L-890; T-763 to L-890; 1-764 to L-890; L-765 
to L-890; K-766to L-890; Y-767 to L-890; S-768 to L-890; G-769 to L-890; S- 
770 to L-890; 1-771 to L-890; A-772 to L-890; T-773 to L-890; L-774 to L-890; 
E-775 to L-890; R-776 to L-890; L-777 to L-890; Q-778 to L-890; S-779 to L- 
890; F-780 to L-890; R-78 1 to L-890; P-782 to L-890; L-783 to L-890; P-784 to 
L-890; E-785 to L-890; P-786 to L-890; L-787 to L-890; T-788 to L-890; V-789 
to L-890; Q-790 to L-890; L-791 to L-890; L-792 to L-890; T-793 to L-890; V- 
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794 to L-890; P-795 to L-890; G-796 to L-890; E-797 to L-890; V-798 to L-890; 
F-799 to L-890; P-800 to L-890; P-801 to L-890; K-802 to L-890; V-803 to L- 
890; K-804 to L-890; Y-805 to L-890, T-806 to L-890; F-807 to L-890; F-808 
to L-890; V-809 to L-890; P-8 10 to L-890; N-81 1 to L-890; D-812 to L-890; V- 
8 1 3 to L-890; D-8 14 to L-890; F-8 1 5 to L-890; S-8 1 6 to L-890; M-8 1 7 to L-890; 
Q-818 to L-890; S-8 19 to L-890; S-820 to L-890; K-821 to L-890; E-822 to L- 
890; R-823 to L-890; A-824 to L-890; T-825 to L-890; T-826 to L-890; N-827 
to L-890; 1-828 to L-890; 1-829 to L-890; Q-830 to L-890; P-83 1 to L-890; L- 
832 to L-890; L-833 to L-890; H-834 to L-890; A-835 to L-890; Q-836 to L-890; 
W-837 to L-890; V-838 to L-890; L-839 to L-890; G-840 to L-890; D-841 to L- 
890; W-842 to L-890; S-843 to L-890; E-844 to L-890; C-845 to L-890; S-846 
to L-890; S-847 to L-890; T-848 to L-890; C-849 to L-890; G-850 to L-890; A- 

- ■ 85 1 to L-890; G-852 to L-890; W-853 to L-890; Q-854 to L-890; R-855 to L 

890; R-856 to L-890; T-857 to L-890; V-858 to L-890; E-859 to L-890; C-860 
to L-890; R-861 to L-890; D-862 to L-890; P-863 to L-890; S-864 to L-890; G- 
865 to L-890; Q-866 to L-890; A-867 to L-890; S-868 to L-890; A-869 to L-890; 
T-870 to L-890; C-871 to L-890; N-872 to L-890; K-873 to L-890; A-874 to L- 
890; L-875 to L-890; K-876 to L-890; P-877 to L-890; E-878 to L-890; D-879 
to L-890; A-880 to L-890; K-881 to L-890; P-882 to L-890; C-883 to L-890; E- 
884 to L-890; S-885 to L-890; of SEQ ID NO:4. 

Moreover, C-terminal deletions of the METH2 polypeptide can also be 
described by the general formula 1-n, where n is an integer from 2 to 890 where 
n corresponds to the position of amino acid residue identified in SEQ ID NO:4. 
Preferably, C-terminal deletions of the METH2 polypeptide of the invention 
shown as SEQ ID NO:4 include polypeptides comprising the amino acid sequence 
of residues: M-l to P-889; M-l to C-888; M-l to L-887; M-l to Q-886; M-l to 
S-885; M-l to E-884; M-l to C-883; M-l to P-882; M-l to K-881; M-l to A- 
880; M-l to D-879; M-l to E-878; M-l to P-877; M-l to K-876; M-l to L-875; 
M-l to A-874; M-l to K-873; M-l to N-872; M-l to C-871; M-l to T-870; M-l 
to A-869; M-l to S-868; M-l to A-867; M-l to Q-866; M-l to G-865; M-l to S- 
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864; M-l toP-863;M-l toD-862; M-l toR-861;M-l toC-860;M-l to E-859; 
M-l to V-858; M-l to T-857; M-l to R-856; M-l to R-855; M-] to Q-854; M-l 
to W-853; M-l to G-852; M-l to A-851; M-l to G-850; M-l to C-849; M-l to 
. T-848;M-1 toS-847;M-l toS-846;M-l toC-845;M-l toE-844;M-l toS-843; 
5 M-l toW-842;M-l toD-841;M-l toG-840;M-l toL-839;M-l to V-838;M-1 

to W-837; M-l to Q-836; M-l to A-835; M-l to H-834; M-l to L-833; M-l to 
L-832;M-1 toP-831;M-l toQ-830;M-l to 1-829; M-l to 1-828; M-l toN-827; 
M-l to T-826; M-l to T-825; M-l to A-824; M-l to R-823; M-l to E-822; M-l 
toK-821;M-l to S-820; M-l to S-819;M-1 toQ-818;M-l to M-817;M-1 to S- 

10 816; M-l toF-815;M-l toD-814;M-l to V-813;M-1 toD-812;M-l toN-811; 

M-l to P-810; M-l to V-809; M-l to F-808; M-l to F-807; M-l to T-806; M-l 
to Y-805; M-l to K-804; M-l to V-803; M-l to K-802; M-l to P-801; M-l to P- 
800; M- 1 to F-799; M- 1 to V-798; M-l to E-797; M- r to G-796; M- 1 to P-795; 
M-l to V-794; M-l to T-793; M-l to L-792; M-l to L-791; M-l to Q-790; M-l 

15 to V-789; M-l to T-788; M-l to L-787; M-l to P-786; M-l to E-785; M-l to P- 

784; M-l to L-783; M-l to P-782; M-l to R-781; M-l to F-780; M-l to S-779; 
M-l to Q-778; M-l to L-777; M-l to R-776; M-l to E-775; M-l toL-774; M-I 
to T-773; M-l to A-772; M-l to 1-771; M-l to S-770; M-l to G-769; M-l to S- 
768; M-l to Y-767; M-l to K-766; M-l to L-765; M-l to 1-764; M-l to T-763; 

20 M-l to G-762; M-l to K-761; M-l to V-760; M-l to L-759; M-l to 1-758; M-l 

to D-757; M-l to Q-756; M-l to E-755; M-l to 1-754; M-l to A-753; M-l to S- 
752; M-l to 1-751; M-l to A-750; M-l to L-749; M-l to N-748; M-l to G-747; 
M-l to N-746; M-l to L-745; M-l to L-744; M-l to Y-743; M-l to Q-742; M-l 
to G-741; M-l to D-740; M-l to A-739; M-l to T-738; M-l to K-737; M-l to 

25 L-736; M-l to A-735; M-l to L-734; M-l to Y-733; M-l to N-732; M-l to G- 

73 1; M-l to D-730; M-l to N-729; M-l to Q-728; M-l to V-727; M-l to G-726; 
M-l to P-725; M-l to H-724; M-l to S-723; M-l to R-722; M-l to Q-721; M-l 
to K-720; M-l to V-719; M-l to D-718; M-l to 1-717; M-l to N-716; M-l to T- 
715; M-l to A-714;M-1 toG-713; M-l to A-712; M-l toP-711;M-l to 1-710; 

30 M-l to T-709; M-l to V-708; M-l to 1-707; M-l to D-706; M-l to N-705; M-l 
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to Y-704; M-l to G-703; M-l to Y-702; M-l to N-701; M-l to T-700; M-l to 
P-699; M-l toT-698;M-1 toL-697;M-l to S-696;M-1 toG-695;M-1 to S-694; 
M-l to V-693; M-l to K-692; M-l to R-691; M-l to C-690; M-l to S-689; M-l 
to N-688; M-l to G-687; M-l to K-686; M-l to G-685; M-l to G-684; M-l to 
5 C-683; M-l to V-682; M-l to G-681; M-l to C-680; M-l to K-679; M-l to D- 

678; M-l to L-677; M-l to K-676; M-l to R-675; M-l to P-674; M-l to S-673; 
M-l to D-672; M-l to V-671 ; M-l to V-670; M-l to H-669; M-l to D-668; M-l 
to C-667; M-l to G-666; M-l to A-665; M-l to K-664; M-l to V-663; M-l to 
C-662; M-l to Q-661; M-l to G-660; M-l to R-659; M-l to V-658; M-l to C- 

10 657; M-l to 1-656; M-l to A-655; M-l to L-654; M-l to T-653; M-l to E-652; 

M-l to P-651; M-l to G-650; M-l to C-649; M-l to L-648; M-l to T-647; M-l 
to G-646; M-l to D-645; M-l to 1-644; M-l to V-643; M-l to K-642; M-l to A- 
641; M-l toE-640;-M-l toF-639; M-l to V-638;M-1 toK-637;M-l to F-636;— 
M-l to E-635; M-l to S-634; M-l to R-633; M-l to G-632; M-l to R-63 1 ; M-l 

15 to A-630; M-l to R-629; M-l to C-628; M-l to F-627; M-l to L-626; M-l to K- 

625; M-l to C-624; M-l to R-623; M-l to D-622; M-l to R-621; M-l to P-620; 
M-l to S-619; M-l to V-618; M-l toG-617;M-l toA-616; M-l to Y-615;M-1 
to K-614; M-l to P-613; M-l to V-612; M-l to W-61 1; M-l to Q-610; M-l to 
L-609; M-l to L-608; M-l to N-607; M-l to G-606; M-l to D-605; M-l to M- 

20 604; M-l to D-603; M-l to T-602; M-l to Y-601; M-l toN-600; M-l to Y-599; 

M-l to A-598; M-l to N-597; M-l to Y-596; M-l to K-595; M-l to E-594; M-l 
to C-593; M-l to Q-592; M-l to Q-591; M-l to E-590; M-l to R-589; M-l to F- 
588; M-l to S-587; M-l to K-586; M-l to G-585; M-l to D-584; M-l to P-583; 
M-l to P-582; M-l to C-581; M-l to E-580; M-l to E-579; M-l to T-578; M-l 

25 to H-577; M-l to C-576; M-l to S-575; M-l to Q-574; M-l to Y-573; M-l to K- 

572; M-l to A-571; M-l to R-570; M-l to R-569; M-l to G-568; M-l to L-567; 
M-l to C-566; M-l to Y-565; M-l to R-564; M-l to G-563; M-l to G-562; M-l 
toN-561;M-l toQ-560;M-l toP-559;M-l toE-558;M-l toP-557;M-l toD- 
556; M-l toK-555;M-l to C-554; M-l toE-553;M-l toR-552; M-l toH-551; 

30 M-l to S-550; M-l to F-549; M-l to Q-548; M-l to V-547; M-l to G-546; M-l 
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toG-545;M-l toG-544;M-l toC-543;M-l toT-542; M-l toR-541;M-l to S- 
540; M-l to C-539; M-l to E-538; M-l to G-537; M-l to W-536; M-l toP-535; 
M-l toG-534;M-l toW-533;M-l toP-532;M-l to A-531;M-1 toW-530; M-l 
to G-529; M-l to G-528; M-l to D-527; M-l to V-526; M-l to V-525; M-l to 
P-524; M-l to K-523; M-l to P-522; M-l to R-521; M-l to E-520; M-l to V- 
519; M-l toE-518;M-l toE-517; M-l toE-516; M-l to P-515; M-l toL-514; 
M-l to C-513; M-l to S-512; M-l to G-511; M-l toE-510; M-l to S-509; M-l 
to C-508; M-l toL-507; M-l to H-506; M-l to G-505; M-l to P-504; M-l to G- 
503; M-l to C-502; M-l to P-501 ; M-l to T-500; M-l to G-499; M-l to D-498; 
M-l to A-497; M-l to W-496; M-l to P-495; M-l to L-494; M-l to S-493; M-l 
to G-492; M-l to N-491; M-l to K-490; M-l to T-489; M-l to H-488; M-l to 
C-487; M-l to L-486; M-l to P-485; M-l to E-484; M-l to A-483; M-l to G- 
482; M-l to D^l^M-l toT-480; M-l to H-479; M-l to C-478; M-_l_to W-47?; 
M-l to L-476; M-l to Q-475; M-l to A-474; M-l to C-473; M-l to V-472; M-l 
to D-471; M-l to Q-470; M-l to A-469; M-l to S-468; M-l to T-467; M-l to N- 
466; M-l to P-465; M-l to C-464; M-l to H-463; M-l to R-462; M-I to F-461; 
M-l to D-460; M-l to P-459; M-l to. G-458; M-l to F-457; M-l to 1-456; M-l 
to Q-455; M-l to R-454; M-l to C-453; M-l to Q-452; M-l to Q-451; M-l to 
D-450; M-l to L-449; M-l to Q-448; M-l to Y-447; M-l to L-446; M-l to A- 
445; M-l to M-444; M-l to R-443; M-l to G-442; M-l to P-441 ; M- 1 to L-440; 
M-l to G-439; M-l to T-438; M-l to P-437; M-l to L-436; M-l to P-435; M-l 
toL-434; M-l to A-433; M-l to A-432; M-l to G-43 1 ; M- 1 to P-43 0; M- 1 to A- 
429; M-l to D-428; M-l to L-427; M-l to L-426; M-l to C-425; M-l to D-424; 
M-l to G-423; M-l to H-422; M-l to G-421; M-l to G-420; M-l to D-419; M-l 
toL-418; M-l toL-417;M-l toE-416;M-l toT-415;M-l toL-414;M-l to Y- 
413; M-l to M-412; M-l to A-41 1; M-l to S-410; M-l to C-409; M-l to P-408; 
M-l to S-407; M-l to W-406; M-l to P-405; M-l to L-404; M-l to T-403; M-l 
to Q-402; M-l to N-401; M-l to L-400; M-l to H-399; M-l to V-398; M-l to 
F-397; M-l to L-396; M-l to P-395; M-l to A-394; M-l to M-393; M-l to V- 
392; M-l toH-391;M-l toH-390;M-l toK-389;M-l toG-388;M-l toM-387; 



WO 99AJ7660 



-89- 



PCT/US99/01313 



M-1 to P-386; M-l to G-385; M-l to F-384; M-l to L-383; M-l to R-382; M-l 
to T-381; M-l to C-380; M-l to P-379; M-l to K-378; M-l to S-377; M-l to D- 
376; M-l toD-375;M-l toH-374;M-l toP-373;M-l to M-372; M-l to S-371; 
M-l to L-370; M-l to V-369; M-l to H-368; M-l to G-367; M-l to L-366; M-l 
5 toE-365;M-l to H-364;M-1 to A-363; M-l toL-362; M-l to T-361; M-l toH- 

360; M-l to A-359; M-l to A-358; M-l to Q-357; M-l to L-356; M-l to G-355; 
M-l to E-354; M-l to D-353; M-l to E-352; M-l to 1-351; M-l to V-350; M-l 
to S-349, M-l to C-348; M-l to S-347; M-l to K-346; M-l to N-345; M-l to P- 
344; M-l to D-343; M-l to C-342; M-l to 1-341; M-l to T-340; M-l to G-339; 

10 M-l to 1-338; M-l to D-337; M-l to A-336; M-l to V-335; M-l to G-334; M-l 

toL-333;M-l to T-332;M-1 to D-331; M-l to C-330; M-l toL-329;.M-l toG- 
328; M-l to E-327; M-l to Q-326; M-l to G-325; M-l to C-324; M-I to F-323; 
- M-l toN-322;M-l toQ-321;M-l to R-320; M-1 to T-319; M-l toL-318;M-l- 
toL-317; M-l to 1-316; M-l to A-315;M-1 toT-314; M-l to D-313;M-1 to Y- 

15 312;M-1 toH-311;M-l toE-310;M-l toP-309;M-l toH-308;M-l toR-307; 

M-l toD-306;M-l to S-305;M-1 toP-304;M-l to Q-303; M-l toN-302; M-l 
to F-301; M-l to R-300; M-l to R-299; M-l to Q-298; M-l to W-297; M-l to 
N-296; M-l to C-295; M-l to F-294; M-l to N-293; M-l to R-292; M-l to L- 
291; M-l to T-290; M-l to L-289; M-l to G-288; M-l to G-287; M-l to N-286; 

20 M-l to D-285; M-l to S-284; M-l to V-283; M-l to E-282; M-l to P-281; M-l 

to G-280; M-l to W-279; M-l to K-278; M-l to E-277; M-l to D-276; M-l to 
E-275; M-l to V-274; M-l to 1-273; M-l to L-272; M-l to V-271; M-l to K- 
270; M-l to V-269; M-l to V-268; M-l to M-267; M-l to L-266; M-l to N-265; 
M-l to 1-264; M-l to S-263; M-l to N-262; M-l to K-261; M-l to 1-260; M-l 

25 to S-259; M-l to P-258; M-l to H-257; M-l to K-256; M-l to Y-255; M-l to I- 

254; M-l to R-253; M-l to A-252; M-l to A-251; M-l to V-250; M-l to S-249; 
M-l to M-248; M-l to L-247; M-l to T-246; M-l to L-245; M-l to 1-244; M-l 
to H-243; M-l to N-242; M-l to Q-241; M-l to L-240; M-l to D-239; M-l to 
A-238; M-l to G-237; M-l to Y-236; M-l to F-235; M-l to A-234; M-l to A- 

30 233; M-l to M-232; M-l to S-23 1; M-l to A-230; M-l to D-229; M-l to A-228; 
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M-l to V-227; M-l to L-226; M-l to L-225; M-l to T-224; M-l to E-223; M-l 
to V-222; M- 1 to F-22 1; M-l to R-220; M- 1 to A-2 1 9; M- 1 to E-2 1 8 ; M- 1 to S- 
217; M-l to V-216;M-1 toF-215;M-l toR-214;M-l toK-213;M-l toT-212; 
M-l toR-211;M-l to S-210; M-l to T-209; M-l to A-208; M-l to G-207; M-l 
5 to L-206; M- 1 to P-205; M- 1 to P-204; M- 1 to P-203 ; M- 1 to P-202; M- 1 to E- 

201; M-l to S-200; M-l toA-199; M-l to G-198; M-l to E-197; M-l toA-196; 
M-l toE-195;M-l toE-194; M-l toE-193;M-l to Q-192; M-l toS-191;M-l 
toE-190; M-l to E-189; M-l toE-188;M-l to S-187; M-l to D-186;M-1 to E- 
185;M-1 toQ-184;M-l toH-183;M-l toD-182;M-l toG-181,M-l toR-180; 
10 M-l to E-179; M-l to Q-178; M-l to R-177; M-l to Q-176; M-l to G-175; M-l 

toE-174;M-l toG-173;M-l toT-172;M-l toE-171;M-l to V-170;M-1 toE- 
169; M-l toW-168;M-l toE-167;M-l toP-166;M-l toG-165;M-l toR-164; 
M- 1 to P- 1 63 ; M- 1 to L-.1 62; M-l to P- 1 6 1 ; M- 1 to R- 1 60; M- 1 to. A- 1 59; M- 1 
to G-158; M-l to A-157; M-l to P-156; M-l to G-155; M-l to W-154; M-l to 

15 R-153; M-l to Q-152; M-l to L-151; M-l to R-150; M-l to H-149; M-l to P- 

148; M-l to Q-147; M-l to A-146; M-l to L-145; M-l to S-144; M-l to G-143; 
M-l to G-142; M-l to A-141; M-l to G-140; M-l to Q-139; M-l to P-138; M-l 
to Q-137; M-l to 1-136; M-l to T-135; M-l to F-134; M-l to E-133; M-l to E- 
132; M-l toG-131;M-l toD-130;M-l toL-129;M-l toL-128; M-l to F-127; 

20 M-l to S-126; M-l to G-125; M-l to S-124; M-l to L-123; M-l to G-122; M-l 

to R-121; M-l to C-120; M-l toL-119; M-l to S-l 18; M-l to V-l 17; M-l to A- 
116; M-l to A-115;M-1 toL-114; M-l to S-l 13; M-l toE-112;M-l toP-111; 
M-l toE-110; M-l toG-109;M-l toN-108; M-l to V-107; M-l toT-106;M-l 
to G-105; M-l to S-l 04; M-l to F- 103; M-l to F-102; M-l to C-l 01; M-l to G- 

25 100; M-l to R-99; M-l to L-98; M-l to G-97; M-l to R-96; M-l to E-95; M-l 

to G-94; M-l to G-93; M-l to T-92; M-l to A- 91; M-l to R-90; M-l to G-89; 
M-l to S-88; M-l to G-87; M-l to G-86; M-l to L-85; M-l to R-84; M-l to E- 
83; M-l to 1-82; M-l to K-81; M-l to F-80; M-l to E-79; M-l to P-78; M-l to 
A-77; M-l to L-76; M-l to F-75; M-l to S-74; M-l to D-73; M-l to D-72; M-l 

30 to P-7 1 ; M- 1 to A-70; M- 1 to L-69; M-l to R-68; M- 1 to L-67; M- 1 to V-66; M- 
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1 to F-65; M-l to G-64; M-l to K-63; M-l to G-62; M-l to F-61; M-l to A-60; 
M-l to S-59; M-l to L-58;M-1 toH-57;M-l toL-56; M-l toA-55;M-l to L- 
54; M-l toE-53; M-l to G-52; M-l to A-51; M-l to S-50; M-l to G-49; M-l to 
P-48; M-l to L-47; M-l to R-46; M-l to T-45; M-l to P-44; M-l to V-43; M-l 
5 to V-42; M-l to L-41 ; M-l to E-40; M-l to S-39; M-l to A-38; M- 1 to Q-37; M- 

1 to G-36; M-l to G-35; M-l toA-34; M-l to A-33;M-1 toP-32;M-l toR-31; 
M-l to A-30; M-l to P-29; M-l to A-28; M-l to G-27; M-l to R-26; M-l to A- 
25; M-l to L-24; M-l to P-23; M-l to L-22; M-l to L-21; M-l to L-20; M-l to 
L-19;M-1 toL-18;M-l toL-17;M-l toL-16;M-3 toL-15;M-l to L-14; M-l 
10 toF-13; M-l toP-12;M-l toL-ll;M-l toW-10;M-l toR-9;M-l toP-8;M-l 

to A-7; of SEQ ID NO:4. Preferably, any of the above listed N- or C-terminal 
deletions can be combined to produce a N- and C-terminal deleted METH2 

polypeptide. - _ 

The invention also provides polypeptides having one or more amino acids 
15 deleted from both the amino and the carboxyl termini, which may be described 

generally as having residues m-n of SEQ ED NO:2 or SEQ ID NO:4, where n and 
m are integers as described above. 

Also preferred are METH1 or METH2 polypeptide and polynucleotide 
fragments characterized by structural or functional domains. Preferred 
20 embodiments of the invention include fragments that comprise alpha-helix and 

alpha-helix forming regions ("alpha-regions"), beta-sheet and beta-sheet-forming 
regions ("beta-regions"), turn and turn-forming regions ("turn-regions"), coil and 
coil-forming regions ("coil-regions"), hydrophilic regions, hydrophobic regions, 
alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 
25 forming regions* substrate binding region, and high antigenic index regions. As 

set out in the Figures, such preferred regions include Garnier-Robson alpha- 
regions, beta-regions, turn-regions, and coil-regions, Chou-Fasman alpha-regions, 
beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions and 
hydrophobic regions, Eisenberg alpha and beta amphipathic regions, Karplus- 
30 Schulz flexible regions, Emini surface-forming regions, and Jameson-Wolf high 
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antigenic index regions. Polypeptide fragments of SEQ ID NO:2 falling within 
conserved domains are specifically contemplated by the present invention. (See 
Figures 1 0 & 1 1 and Tables 1& 2.) Moreover, polynucleotide fragments encoding 
these domains are also contemplated. 

Other preferred fragments are biologically active METH 1 or METH2 
fragments. Biologically active fragments are those exhibiting activity similar, but 
not necessarily identical, to an activity of the METH1 or METH2 polypeptide. 
The biological activity of the fragments may include an improved desired activity, 
or a decreased undesirable activity. 

However, many polynucleotide sequences, such as EST sequences, are 
publicly available and accessible through sequence databases. Some of these 
sequences are related to SEQ ID NO:l or SEQ ID NO:3 and may have been 
publicly available prior to conception of the present invention. Preferably, such 
related polynucleotides are specifically excluded from the scope of the present 
invention. To list every related sequence would be cumbersome. Accordingly, 
preferably excluded from the present invention are one or more polynucleotides 
comprising a nucleotide sequence described by the general formula of a-b, where 
a is any integer between 1 to 936 of SEQ ED NO: 1, b is an integer of 1 5 to 950, 
where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ED NO:l, and where the b is greater than or equal to a + 14. Moreover, 
preferably excluded from the present invention are one or more polynucleotides 
comprising a nucleotide sequence described by the general formula of a-b, where 
a is any integer between 1 to 876 of SEQ ID NO:3, b is an integer of 15 to 890, 
where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:3, and where the b is greater than or equal to a + 14. 

Epitopes & Antibodies , 

In the present invention, "epitopes" refer to METH1 or METH2 
polypeptide fragments having antigenic or immunogenic activity in an animal, 
especially in a human. A preferred embodiment of the present invention relates 
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to a METH1 or METH2 polypeptide fragment comprising an epitope, as well as 
the polynucleotide encoding this fragment. A region of a protein molecule to 
which an antibody can bind is defined as an "antigenic epitope." In contrast, an 
"immunogenic epitope" is defined as a part of a protein that elicits an antibody 
5 response. (See, for instance, Geysen et al y Proc. Nail. Acad. Sci. USA 57:3998- 

4002(1983).) 

Fragments which function as epitopes may be produced by any 
conventional means. (See, e.g., Houghten, R. A., Proc. Nail Acad. Sci. USA 
52:5131-5135 (1985) further described in U.S. Patent No. 4,631,211.) 

10 In the present invention, antigenic epitopes preferably contain a sequence 

of at least seven, more preferably at least nine, and most preferably between about 
15 to about 30 amino acids. Antigenic epitopes are useful to raise antibodies, 
including monoclonal antibodies, that specifically bind the epitope. (See, for 
instance, Wilson et al, Cell 37:767-778 (1984); Sutcliffe, J. G. et al y Science 

15 279:660-666(1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies 
according to methods well known in the art. (See, for instance, Sutcliffe et al y 
supra; Wilson etal, supra; Chow, M. etal, Proc. Nail Acad Sci. USA 52:910- 
914; and Bittle, F. J. etal, J. Gen. Virol 66:2347-2354 (1985).) A preferred 

20 immunogenic epitope includes the secreted protein. The immunogenic epitopes 

may be presented together with a carrier protein, such as an albumin, to an animal 
system (such as rabbit or mouse) or, if it is long enough (at least about 25 amino 
acids), without a carrier. However, immunogenic epitopes comprising as few as 
8 to 10 amino acids have been shown to be sufficient to raise antibodies capable 

25 of binding to, at the very least, linear epitopes in a denatured polypeptide (e.g., in 

Western blotting.) 

Using DNAstar analysis, SEQ ID NO: 2 was found antigenic at amino 
acids: 2-14,32-44,47-60,66-78,87-103, 109-118, 146-162, 168-180, 183-219, 
223-243,275-284,296-306,314-334,341-354,357-376,392-399, 401-410,418- 

30 429,438-454,456-471,474-488, 510-522, 524-538, 550-561, 565-626,630-643, 
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659-67 1 , 679-72 1 , 734-749, 784-804, 8 1 3-820, 825-832, 845-854, 860-894, 899- 
917, 919-924 and 928-939. Thus, these regions could be used as epitopes to 
produce antibodies against the protein encoded by METH1 cDNA. 

Using DNAstar analysis, SEQ ID NO:4 was found antigenic at amino 
acids: 26-38, 45-52, 69-76, 80-99, 105-113, 129-136, 138-217, 254-263, 273- 
289,294-313,321-331,339-356,371-383,417-427,438-443,459-471,479-505, 
507-526, 535-546, 550-607, 61 5-640, 648-653, 660-667, 669-681, 683-704, 71 7- 
732, 737-743, 775-787, 797-804, 811-825, 840-867 and 870-884. Thus, these 
regions could be used as epitopes to produce antibodies against the protein 
encoded by METH2 cDNA. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) 
is meant to include intact molecules as well as antibody fragments (such as, for 
example, Fab and F(ab')2 fragments) which are.capable of specifically binding to 
protein. Fab and F(ab')2 fragments lack the Fc fragment of intact antibody, clear 
more rapidly from the circulation, and may have less non-specific tissue binding 
than an intact antibody. (Wahl et al, J. NucL Med 24:3 16-325 (1983).) Thus, 
these fragments are preferred, as well as the products of a FAB or other 
immunoglobulin expression library. Moreover, antibodies of the present invention 
include chimeric, single chain, and humanized antibodies. 

Fusion Proteins 

Any METH1 or METH2 polypeptide can be used to generate fusion 
proteins. For example, the METH1 or METH2 polypeptide, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
METH1 or METH2 polypeptide can be used to indirectly detect the second 
protein by binding to the METH1 or METH2. Moreover, because secreted 
proteins target cellular locations based on trafficking signals, the METH1 or 
METH2 polypeptides can be used as a targeting molecule once fused to other 
proteins. 
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Examples of domains that can be fused to METH1 or METH2 
polypeptides include not only heterologous signal sequences, but also other 
heterologous functional regions. The fusion does not necessarily need to be 
direct, but may occur through linker sequences. 
5 Moreover, fusion proteins may also be engineered to improve 

characteristics of the METH1 or METH2 polypeptide. For instance, a region of 
additional amino acids, particularly charged amino acids, may be added to the N- 
terminus of the METH1 or METH2 polypeptide to improve stability and 
persistence during purification from the host cell or subsequent handling and 
10 storage. Also, peptide moieties may be added to the METH1 or METH2 

polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the METH1 or METH2 polypeptide. The addition of peptide 
moieties to facilitate~handling ofpolypeptides^arefamiliar and routine techniques 
in the art. 

15 Moreover, METH1 or METH2 polypeptides, including fragments, and 

specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported 
example describes chimeric proteins consisting of the first two domains of the 

20 human CD4-polypeptide and various domains of the constant regions of the heavy 

or light chains of mammalian immunoglobulins. (EP A 394,827; Traunecker et 
al. y Nature 337:84-86 (1988).) Fusion proteins having disulfide-linked dimeric 
structures (due to the IgG) can also be more efficient in binding and neutralizing 
other molecules, than the monomelic secreted protein or protein fragment alone. 

25 (Fountoulakis et al., J. Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses 
fusion proteins comprising various portions of constant region of immunoglobulin 
molecules together with another human protein or part thereof. In many cases, 
the Fc part in a fusion protein is beneficial in therapy and diagnosis, and thus can 

30 result in, for example, improved pharmacokinetic properties. (EP-A 0232 262.) 
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Alternatively, deleting the Fc part after the fusion protein has been expressed, 
detected, and purified, would be desired. For example, the Fc portion may hinder 
therapy and diagnosis if the fusion protein is used as an antigen for immunizations. 
In drug discovery, for example, human proteins, such as hIL-5, have been fused 
with Fc portions for the purpose of high-throughput screening assays to identify 
antagonists of hIL-5. (See, D. Bennett et al., J. Molecular Recognition 8:52-58 
(1995); K. Johanson et al., J. Biol. Chem. 270:9459-9471 (1995).) 

Moreover, the METH1 or METH2 polypeptides can be fused to marker 
sequences, such as a peptide which facilitates purification of METH1 or METH2. 
In preferred embodiments, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (Q1AGEN, Inc., 9259 Eton 
Avenue, Chatsworth, CA, 91311), among others, many of which are commercially 
available. As described in Gentz et a!., Proc. Natl Acad. Sci. USA 56^821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the 
fusion protein. Another peptide tag useful for purification, the "HA" tag, 
corresponds to an epitope derived from the influenza hemagglutinin protein. 
(Wilson et a!., Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the METH1 or 
METH2 polynucleotides or the polypeptides. 

Biological Activities of METH1 or METH2 

METH1 or METH2 polynucleotides and polypeptides can be used in 
assays to test for one or more biological activities. If METH1 or METH2 
polynucleotides and polypeptides do exhibit activity in a particular assay, it is 
likely that METH1 or METH2 may be involved in the diseases associated with the 
biological activity. Therefore, METH1 or METH2 could be used to treat the 
associated disease. 
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Immune Activity 

METH1 or METH2 polypeptides or polynucleotides may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting 
the proliferation, differentiation, or mobilization (chemotaxis) of immune cells. 
Immune cells develop through a process called hematopoiesis, producing myeloid 
(platelets, red blood cells, neutrophils, and macrophages) and lymphoid (B and T 
lymphocytes) cells from pluripotent stem cells. The etiology of these immune 
deficiencies or disorders may be genetic, somatic, such as cancer or some 
autoimmune disorders, acquired (e.g., by chemotherapy or toxins), or infectious. 
Moreover, METH1 or METH2 polynucleotides or polypeptides can be used as a 
marker or detector of a particular immune system disease or disorder. 

METH1 or METH2 polynucleotides, or polypeptides may be useful in 
treating oT r det"ecting deficiencies or disorders of hematopoietic cells. METH1 or 
METH2 polypeptides or polynucleotides could be used to increase differentiation 
and proliferation of hematopoietic cells, including the pluripotent stem cells, in an 
effort to treat those disorders associated with a decrease in certain (or many) types 
hematopoietic cells. Examples of immunologic deficiency syndromes include, but 
are not limited to: blood protein disorders (e.g. agammaglobulinemia, 
dysgammaglobulinemia), ataxia telangiectasia, common variable 
immunodeficiency, Digeorge Syndrome, HIV infection, HTLV-BLV infection, 
leukocyte adhesion deficiency syndrome, lymphopenia, phagocyte bactericidal 
dysfunction, severe combined immunodeficiency (SCIDs), Wiskott-Aldrich 
Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, METH1 or METH2 polypeptides or polynucleotides can also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity 
(clot formation). For example, by increasing hemostatic or thrombolytic activity, 
METH1 or METH2 polynucleotides or polypeptides could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or 
other causes. Alternatively, METH 1 or METH2 polynucleotides or polypeptides 
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that can decrease hemostatic or thrombolytic activity could be used to inhibit or 
dissolve clotting, important in the treatment of heart attacks (infarction), strokes, 
or scarring. 

METH1 or METH2 polynucleotides or polypeptides may also be useful 
in treating or detecting autoimmune disorders. Many autoimmune disorders result 
from inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction 
of the host tissue. Therefore, the administration of METH1 or METH2 
polypeptides or polynucleotides that can inhibit an immune response, particularly 
the proliferation, differentiation, or chemotaxis of T-cells, may be an effective 
therapy in preventing autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by 
— METH1 or METH2 include, but are not limited to: Addison's Pisease,-hemolytic— 
anemia, antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic 
encephalomyelitis, glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, 
Multiple Sclerosis, Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, 
Pemphigus, Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, 
Autoimmune Thyroiditis, Systemic Lupus Erythematosus, Autoimmune 
Pulmonary Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes 
mellitis, and autoimmune inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, may also be treated by METH1 or 
METH2 polypeptides or polynucleotides. Moreover, METH 1 or METH2 can be 
used to treat anaphylaxis, hypersensitivity to an antigenic molecule, or blood 
group incompatibility. 

METH1 orMETH2 polynucleotides or polypeptides may also be used to 
treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through 
an immune response. Similarly, an immune response is also involved in GVHD; 
but, in this case, the foreign transplanted immune cells destroy the host tissues. 
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The administration of METH1 or METH2 polypeptides or polynucleotides that 
inhibits an immune response, particularly the proliferation, differentiation, or 
chemotaxis of T-cells, may be an effective therapy in preventing organ rejection 
or GVHD. 

5 Similarly, METH1 or METH2 polypeptides or polynucleotides may also 

be used to modulate inflammation. For example, METH1 or METH2 
polypeptides or polynucleotides may inhibit the proliferation and differentiation of 
cells involved in an inflammatory response. These molecules can be used to treat 
inflammatory conditions, both chronic and acute conditions, including 
10 inflammation associated with infection (e.g., septic shock, sepsis, or systemic 

inflammatory response syndrome (SIRS)), ischemia-reperfiision injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine 
or chemokine induced lung injury, inflammatory bowerdisease, Crohn's disease, 
or resulting from over production of cytokines (e.g., TNF or IL-1 .) 



15 Hyperproliferative Disorders 

METH1 or METH2 polypeptides or polynucleotides can be used to treat 
or detect hyperproliferative disorders, including neoplasms. METH1 or METH2 
polypeptides or polynucleotides may inhibit the proliferation of the disorder 
through direct or indirect interactions. Alternatively, METH1 or METH2 

20 polypeptides or polynucleotides may proliferate other cells which can inhibit the 

hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, 
differentiating, or mobilizing T-cells, hyperproliferative disorders can be treated. 

25 This immune response may be increased by either enhancing an existing immune 

response, or by initiating a new immune response. Alternatively, decreasing an 
immune response may also be a method of treating hyperproliferative disorders, 
such as a chemotherapeutic agent. 
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Examples of hyperproliferative disorders that can be treated or detected 
by METH1 or METH2 polynucleotides or polypeptides include, but are not 
limited to neoplasms located in the: abdomen, bone, breast, digestive system, liver, 
pancreas, peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, 
ovary, thymus, thyroid), eye, head and neck, nervous (central and peripheral), 
lymphatic system, pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected 
by METH1 or METH2 polynucleotides or polypeptides. Examples of such 
hyperproliferative disorders include, but are not limited to: 
hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, 
purpura, sarcoidosis, Sezary Syndrome, Waldenstron's Macroglobulinemia, 
Gaucher ! s Disease, histiocytosis, and any other hyperproliferative disease, besides 
neoplasia^ located -in an organ system .listed above. .. . 

Infectious Disease 

METH1 or METH2 polypeptides or polynucleotides can be used to treat 
or detect infectious agents. For example, by increasing the immune response, 
particularly increasing the proliferation and differentiation of B and/or T cells, 
infectious diseases may be treated. The immune response may be increased by 
either enhancing an existing immune response, or by initiating a new immune 
response. Alternatively, METH1 or METH2 polypeptides or polynucleotides may 
also directly inhibit the infectious agent, without necessarily eliciting an immune 
response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by METH 1 or METH2 polynucleotides 
or polypeptides. Examples of viruses, include, but are not limited to the following 
DN A and RN A viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, 
Flaviviridae, Hepadnaviridae (Hepatitis), Herpesviridae; (such as, 
Cytomegalovirus, Herpes Simplex, Herpes Zoster), Mononegavirus (e.g., 
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Paramyxoviridae, Morbillivirus, Rhabdoviridae), Orthomyxoviridae (e.g., 
Influenza), Papovaviridae, Parvoviridae, Picornaviridae, Poxviridae (such as 
Smallpox or Vaccinia), Reoviridae (e.g., Rotavirus), Retroviridae (HTLV-I, 
HTLV-II, Lentivinis), and Togaviridae (e.g., Rubivirus). Viruses falling within 
these families can cause a variety of diseases or symptoms, including, but not 
limited to: arthritis, bronchiolitis, encephalitis, eye infections (e.g., conjunctivitis, 
keratitis), chronic fatigue syndrome, hepatitis (A, B, C, E, Chronic Active, Delta), 
meningitis, opportunistic infections (e.g., AIDS), pneumonia, Burkitt's 
Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, Parainfluenza, 
Rabies, the common cold, Polio, leukemia, Rubella, sexually transmitted diseases, 
skin diseases (e.g., Kaposi's, warts), and viremia. METH1 or METH2 
polypeptides or polynucleotides can be used to treat or detect any of these 
" symptoms or diseases. ' — ~ 

Similarly, bacterial or fungal agents that can cause disease or symptoms 
and that can be treated or detected by METH1 or METH2 polynucleotides or 
polypeptides include, but not limited to, the following Gram-Negative and Gram- 
positive bacterial families and fungi: Actinomycetales (e.g., Corynebacterium, 
Mycobacterium, Norcardia), Aspergillosis, Bacillaceae (e.g., Anthrax, 
Clostridium), Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, Brucellosis, 
Candidiasis, Campylobacter, Coccidioidomycosis, Cryptococcosis, 
Dermatocycoses, Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), 
Erysipelothrix, Helicobacter, Legionellosis, Leptospirosis, Listeria, 
Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, Gonorrhea, Menigococcal), 
Pasteurellacea Infections (e.g., Actinobacillus, Heamophilus, Pasteurella), 
Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, and Staphylococcal. 
These bacterial or fungal families can cause the following diseases or symptoms, 
including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., 
AIDS related infections), paronychia, prosthesis-related infections, Reiter's 
Disease, respiratory tract infections, such as Whooping Cough or Empyema, 
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sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food 
poisoning, Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, 
Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, 
tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, 
5 skin diseases (e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, 

wound infections. METH1 or METH2 polypeptides or polynucleotides can be 
used to treat or detect any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be 
treated or detected by METHI or METH2 polynucleotides or polypeptides 

10 include, but not limited to, the following families: Amebiasis, Babesiosis, 

Coccidiosis, Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, 
Giardiasis, Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, 

.^Trypanosomiasis, and Trichomonas.. These parasites can cause a variety „o£ . 

diseases or symptoms, including, but not limited to: Scabies, Trombiculiasis, eye 

15 infections, intestinal disease (e.g., dysentery, giardiasis), liver disease, lung disease, 

opportunistic infections (e.g., AIDS related), Malaria, pregnancy complications, 
and toxoplasmosis. METH1 or METH2 polypeptides or polynucleotides can be 
used to treat or detect any of these symptoms or diseases. 

Preferably, treatment using METH1 or METH2 polypeptides or 

20 polynucleotides could either be by administering an effective amount of METH1 

or METH2 polypeptide to the patient, or by removing cells from the patient, 
supplying the cells with METH1 or METH2 polynucleotide, and returning the 
engineered cells to the patient (ex vivo therapy). Moreover, the METHI or 
METH2 polypeptide or polynucleotide can be used as an antigen in a vaccine to 

25 raise an immune response against infectious disease. 

Regeneration 

METHI or METH2 polynucleotides or polypeptides can be used to 
differentiate, proliferate, and attract cells, leading to the regeneration of tissues. 
(See, Science 276:59-87 (1997).) The regeneration of tissues could be used to 
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repair, replace, or protect tissue damaged by congenital defects, trauma (wounds, 
burns, incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, 
periodontal disease, liver failure), surgery, including cosmetic plastic surgery, 
fibrosis, reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include 
organs (e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, 
skeletal or cardiac), vascular (including vascular endothelium), nervous, 
hematopoietic, and skeletal (bone, cartilage, tendon, and ligament) tissue. 
Preferably, regeneration occurs without or decreased scarring. Regeneration also 
may include angiogenesis. 

Moreover, METH1 or METH2 polynucleotides or polypeptides may 
increase regeneration of tissues difficult to heal. For example, increased 
tendon/ligament regeneration would qiiickeri recovery time^fter^amage7I^TH 1 
or METH2 polynucleotides or polypeptides of the present invention could also be 
used prophylactically in an effort to avoid damage. Specific diseases that could 
be treated include of tendinitis, carpal tunnel syndrome, and other tendon or 
ligament defects. A further example of tissue regeneration of non-healing wounds 
includes pressure ulcers, ulcers associated with vascular insufficiency, surgical, and 
traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using 
METH1 or METH2 polynucleotides or polypeptides to proliferate and 
differentiate nerve cells. Diseases that could be treated using this method include 
central and peripheral nervous system diseases, neuropathies, or mechanical and 
traumatic disorders (e.g., spinal cord disorders, head trauma, cerebrovascular 
disease, and stoke). Specifically, diseases associated with peripheral nerve 
injuries, peripheral neuropathy (e.g. , resulting from chemotherapy or other medical 
therapies), localized neuropathies, and central nervous system diseases (e.g., 
Alzheimer's disease, Parkinson's disease, Huntington's disease, amyotrophic lateral 
sclerosis, and Shy-Drager syndrome), could all be treated using the METH1 or 
METH2 polynucleotides or polypeptides. 
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Chemotaxis 

METH 1 or METH2 polynucleotides or polypeptides may have chemotaxis 
activity. A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or 
endothelial cells) to a particular site in the body, such as inflammation, infection, 
or site of hyperproliferation. The mobilized cells can then fight off and/or heal the 
particular trauma or abnormality. 

METH1 or METH2 polynucleotides or polypeptides may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be 
used to treat inflammation, infection, hyperproliferative disorders, or any immune 
system disorder by increasing the number of cells targeted to a particular location 
in the body. For example, chemotaxic molecules can be used to treat wounds and 

other trauma-to_tissues.by„attracting.irnmune cells to„the injured location. As a„ 

chemotactic molecule, METH1 or METH2 could also attract fibroblasts, which 
can be used to treat wounds. 

It is also contemplated that METH1 or METH2 polynucleotides or 
polypeptides may inhibit chemotactic activity. These molecules could also be used 
to treat disorders. Thus, METH1 or METH2 polynucleotides or polypeptides 
could be used as an inhibitor of chemotaxis. 

Binding Activity 

METH 1 or METH2 polypeptides may be used to screen for molecules that 
bind to METH1 or METH2 or for molecules to which METH1 or METH2 binds. 
The binding of METH 1 or METH2 and the molecule may activate (agonist), 
increase, inhibit (antagonist), or decrease activity of the METH] or METH2 or 
the molecule bound. Examples of such molecules include antibodies, 
oligonucleotides, proteins (e.g., receptors),or small molecules. 

Preferably, the molecule is closely related to the natural ligand of METH 1 
or METH2, e.g., a fragment of the ligand, or a natural substrate, a ligand, a 
structural or functional mimetic. (See, Coligan et a/., Current Protocols in 
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Immunology / (2) .Chapter 5 (1991).) Similarly, the molecule can be closely 
related to the natural receptor to which METH1 or METH2 binds, or at least, a 
fragment of the receptor capable of being bound by METH1 or METH2 (e.g., 
active site). In either case, the molecule can be rationally designed using known 
techniques. 

Preferably, the screening for these molecules involves producing 
appropriate cells which express METH1 or METH2, either as a secreted protein 
or on the cell membrane. Preferred cells include cells from mammals, yeast, 
Drosophila, or E. colL Cells expressing METH1 or METH2(or cell membrane 
containing the expressed polypeptide) are then preferably contacted with a test 
compound potentially containing the molecule to observe binding, stimulation, or 
inhibition of activity of either METH1 or METH2 or the molecule. 

" "The assay may simply test binding of a candidate compound toMETH 1 or 
METH2, wherein binding is detected by a label, or in an assay involving 
competition with a labeled competitor. Further, the assay may test whether the 
candidate compound results in a signal generated by binding to METH1 or 
METH2. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural 
product mixtures. The assay may also simply comprise the steps of mixing a 
candidate compound with a solution containing METH1 or METH2, measuring 
METH1 or METH2/molecule activity or binding, and comparing the METH1 or 
METH2/moIecule activity or binding to a standard. 

Preferably, an ELISA assay can measure METH1 or METH2 level or 
activity in a sample (e.g., biological sample) using a monoclonal or polyclonal 
antibody. The antibody can measure METH1 or METH2 level or activity by 
either binding, directly or indirectly, to METH1 or METH2 or by competing with 
METH1 or METH2 for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. 
The molecules discovered using these assays can be used to treat disease or to 
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bring about a particular result in a patient (e.g., blood vessel growth) by activating 
or inhibiting the METH1 or METH2 molecule. Moreover, the assays can discover 
agents which may inhibit or enhance the production of METH1 or METH2 from 
suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds 
which bind to METH1 or METH2 comprising the steps of: (a) incubating a 
candidate binding compound with METH1 or METH2; and (b) determining if 
binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate 
compound with METH1 or METH2, (b) assaying a biological activity , and (b) 
determining if a biological activity of METH1 or METH2 has been altered. 

Qther„A£tivities _.. . . . . .._ 

METH1 or METH2 polypeptides or polynucleotides may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

METH1 or METH2 polypeptides or polynucleotides may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye 
color, skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., 
cosmetic surgery). Similarly, METH1 orMETH2 polypeptides or polynucleotides 
may be used to modulate mammalian metabolism affecting catabolism, anabolism, 
processing, utilization, and storage of energy. 

METH1 or METH2 polypeptides or polynucleotides may be used to 
change a mammal's mental state or physical state by influencing biorhythms, 
circadian rhythms, depression (including depressive disorders), tendency for 
violence, tolerance for pain, reproductive capabilities (preferably by Activin or 
Inhibin-Iike activity), hormonal or endocrine levels, appetite, libido, memory, 
stress, or other cognitive qualities. 

METH1 or METH2 polypeptides or polynucleotides may also be used as 
a food additive or preservative, such as to increase or decrease storage 
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capabilities, fat content, lipid, protein, carbohydrate, vitamins, minerals, cofactors 
or other nutritional components. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Cancer Diagnosis and Prognosis 

It is believed that certain tissues in mammals with cancer express 
significantly diminished levels of the METH1 or METH2 protein and mRNA 
encoding the METH1 or METH2 protein when compared to a corresponding 
"standard" mammal, i.e., a mammal of the same species not having the cancer. 
- Further, it is believed that diminished levels of the METH 1 or METH2 proteincan 
be detected in certain body fluids (e.g., sera, plasma, urine, and spinal fluid) from 
mammals with cancer when compared to sera from mammals of the same species 
not having the cancer. Thus, the invention provides a diagnostic method useful 
during tumor diagnosis, which involves assaying the expression level of the gene 
encoding the METH1 protein in mammalian cells or body fluid and comparing the 
gene expression level with a standard METH1 gene expression level, whereby a 
decrease in the gene expression level under the standard is indicative of certain 
tumors. The invention also provides a diagnostic method useful during tumor 
diagnosis, which involves assaying the expression level of the gene encoding the 
METH2 protein in mammalian cells or body fluid and comparing the gene 
expression level with a standard METH2 gene expression level, whereby a 
decrease in the gene expression level under the standard is indicative of certain 
tumors. 

Where a tumor diagnosis has already been made according to conventional 
methods, the present invention is useful as a prognostic indicator, whereby 
patients exhibiting diminished METH 1 or METH2 gene expression will experience 
a worse clinical outcome relative to patients expressing the gene at a lower level. 
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By "assaying the expression level of the gene encoding the METH1 or 
METH2 protein" is intended qualitatively or quantitatively measuring or 
estimating the level of the METH1 or METH2 protein or the level of the mRNA 
encoding the METH 1 or METH2 protein in a first biological sample either directly 
(e.g., by determining or estimating absolute protein level or mRNA level) or 
relatively (e.g., by comparing to the METH1 or METH2 protein level or mRNA 
level in a second biological sample). 

Preferably, the METH1 or METH2 protein level or mRNA level in the 
first biological sample is measured or estimated and compared to a standard 
METH1 or METH2 protein level or mRNA level, the standard being taken from 
a second biological sample obtained from an individual not having the cancer. As 
will be appreciated in the art, once a standard METH1 or METH2 protein level 
or mRNA level is known, it can-be used repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from an 
individual, cell line, tissue culture, or other source which contains METH1 or 
METH2 protein or mRNA. Biological samples include mammalian body fluids 
(such as sera, plasma, urine, synovial fluid and spinal fluid) which contain secreted 
mature METH1 or METH2 protein, and adrenal, thyroid, stomach, brain, heart, 
placenta, lung, liver, muscle, kidney, pancreas, testis and ovarian tissue (for 
METH1); and prostate, small intestine, colon, brain and lung tissue (for METH2). 

The present invention is useful for detecting cancer in mammals. In 
particular the invention is useful during diagnosis of the of following types of 
cancers in mammals: breast, ovarian, prostate, liver, lung, pancreatic, colon, and 
testicular. Preferred mammals include monkeys, apes, cats, dogs, cows, pigs, 
horses, rabbits and humans. Particularly preferred are humans. 

Total cellular RNA can be isolated from a biological sample using the 
single-step guanidinium-thiocyanate-phenol-chloroform method described in 
Chomczynski and Sacchi, Anal. Biochem. 162: 156-159(1 987). Levels of mRNA 
encoding the METH1 or METH2 protein are then assayed using any appropriate 
method. These include Northern blot analysis (Harada et al f Cell 53:303-3 12 
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(1990)), SI nuclease mapping (Fujita et al, Cell 49:351- 367 (1987)), the 
polymerase chain reaction (PCR), reverse transcription in combination with the 
polymerase chain reaction (RT-PCR) (Makino et al, Technique 2:295-301 
(1990)), and reverse transcription in combination with the ligase chain reaction 
(RT-LCR). 

Assaying METH1 or METH2 protein levels in a biological sample can 
occur using antibody-based techniques. For example, METH 1 or METH2 protein 
expression in tissues can be studied with classical immunohistological methods 
(Jalkanen, M., et al, J. Cell Biol 707:976-985 (1985); Jalkanen, M., et al t J. 
Cell . Biol 705:3087-3096 (1987)). 

Other antibody-based methods useful for detecting METH1 or METH2 
protein gene expression include immunoassays, such as the enzyme linked 
immunosorbent assay (EEISA)"ahd~tHe radioimmunoassay (RIA). 

Suitable labels are known in the art and include enzyme labels, such as, 
glucose oxidase, and radioisotopes, such as iodine ( l25 I, ,2, I), carbon ( I4 C), sulfur 
( 35 S), tritium ( 3 H), indium ( I,2 In), and technetium fTc), and fluorescent labels, 
such as fluorescein and rhodamine, and biotin. 

Modes of administration 

It is recognized than an increase in the vascular supply plays a central role 
in tumor progression and metastasis; therefore, inhibitors of angiogenesis can 
prove effective as adjuvant therapy for cancer patients. Some of the currently 
recognized angiogenic suppressors are poor candidates for systemic treatment due 
to severe collateral effect. The present inventors have found that METH1 and 
METH2 are potent inhibitors of angiogenesis both in vitro and in vivo. The 
advantage of METH 1 and METH1 is that these inhibitors are normally associated 
with suppression of physiological angiogenesis; therefore, they offer lack of 
toxicity and endothelial specificity over other angiogenic inhibitors. Furthermore, 
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METH1 and METH2 present a restricted pattern of expression providing a 
possible advantage on organ specificity. 

Accordingly, the polypeptides of the present invention may be employed 
to treat cancer. The METH1 and METH2 polypeptides of the present invention 
can also be used to treat individuals with other disorders that are related to 
angiogenesis, including abnormal wound healing, inflammation, rheumatoid 
arthritis, psoriasis, endometrial bleeding disorders, diabetic retinopathy, some 
forms of macula degeneration, hemangiomas, and arterial-venous malformations. 

Thus, the invention provides a method of inhibiting angiogenesis in an 
individual comprising administering to such an individual a pharmaceutical 
composition comprising an effective amount of an isolated METH1 polypeptide 
of the invention, effective to increase the METH1 activity level in such an 
individual. The invention also provides a method of inhibiting angiogenesis in an 
individual comprising administering to such an individual a pharmaceutical 
composition comprising an effective amount of an isolated METH2 polypeptide 
of the invention, effective to increase the METH2 activity level in such an 
individual. 

METH1 polypeptides which may be used to inhibit angiogenesis in this 
manner include: METH1 polypeptide encoded by the deposited cDNA including 
the leader; the mature METH 1 polypeptide encoded by the deposited the cDNA 
minus the leader (i.e., the mature protein); a polypeptide comprising amino acids 
about 1 to about 950 in SEQ ID NO:2; a polypeptide comprising amino acids 
about 2 to about 950 in SEQ ED NO:2; a polypeptide comprising amino acids 
about 29 to about 950 in SEQ ID NO:2; a polypeptide comprising amino acids 
about 30 to about 950 in SEQ ID NO:2; a polypeptide comprising the 
metalloprotease domain of METH1, amino acids 235 to 459 in SEQ ID NO:2; a 
polypeptide comprising the disintegrin domain of METH 1 , amino acids 460 to 544 
in SEQ ID NO:2; a polypeptide comprising the first TSP-like domain of METH 1 , 
amino acids 545 to 598 in SEQ ID NO:2; a polypeptide comprising the second 
TSP-like domain of METH 1, amino acids 841 to 894 in SEQ ID NO:2; a 
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polypeptide comprising the third TSP-like domain of METH1, amino acids 895 
to 934 in SEQ ID NO:2; a polypeptide comprising amino acids 536 to 6 1 3 in SEQ 
ID NO:2; and a polypeptide comprising amino acids 549 to 563 in SEQ ED NO:2. 

METH2 polypeptides which may be used to inhibit angiogenesis in this 
manner include: the METH2 polypeptide encoded by the deposited cDNA 
including the leader; the mature METH2 polypeptide encoded by the deposited the 
cDNA minus the leader (i.e., the mature protein); a polypeptide comprising amino 
acids about 1 to about 890 in SEQ ID NO:4; a polypeptide comprising amino 
acids about 2 to about 890 in SEQ ID NO:4; a polypeptide comprising amino 
acids about 24 to about 890 in SEQ ID NO:4; a polypeptide comprising amino 
acids about 112 to about 890 in SEQ ID NO:4; a polypeptide comprising the* 
metalloprotease domain of METH2, amino acids 214 to 439 in SEQ ID NO:4; a 
polypeptide comprising the disintegriri domain of METH2,"amino acids~440 to 529' 
in SEQ ID NO:4; a polypeptide comprising the first TSP-like domain of METH2, 
amino acids 530 to 583 in SEQ ID NO:4; a polypeptide comprising the second 
TSP-like domain of METH2, amino acids 837 to 890 in SEQ ID NO:4; a 
polypeptide comprising amino acids 280 to 606 in SEQ ID NO:4; and a 
polypeptide comprising amino acids 529 to 548 in SEQ ID NO:4. 

As a general proposition, the total pharmaceutical^ effective amount of 
METH1 or METH2 polypeptide administered parenterally per dose will be in the 
range of about 1 pg/kg/day to 10 mg/kg/day of patient body weight, although, as 
noted above, this will be subject to therapeutic discretion. More preferably, this 
dose is at least 0.01 mg/kg/day, and most preferably for humans between about 
0.01 and 1 mg/kg/day for the polypeptide. If given continuously, the METH1 or 
METH2 polypeptide is typically administered at a dose rate of about 1 jig/kg/hour 
to about 50 pg/kg/hour, either by 1-4 injections per day or by continuous 
subcutaneous infusions, for example, using a mini-pump. An intravenous bag 
solution may also be employed. 

Pharmaceutical compositions containing the METH1 or METH2 of the 
invention may be administered orally, rectally, parenterally, intracistemally, 
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intravaginally, intraperitoneal^, topically (as by powders, ointments, drops or 
transdermal patch), bucally, or as an oral or nasal spray. By "pharmaceutically 
acceptable carrier" is meant a non-toxic solid, semisolid or liquid filler, diluent, 
encapsulating material or formulation auxiliary of any type. The term "parenteral" 
5 as used herein refers to modes of administration which include intravenous, 

intramuscular, intraperitoneal, intrasternal, subcutaneous and intraarticular 
injection and infusion. 

Chromosome Assays 

The nucleic acid molecules of the present invention are also valuable for 
10 chromosome identification. The sequence is specifically targeted to and can 

hybridize with a particular location _on an individual-human, chromosome. The 
mapping of DNAs to chromosomes according to the present invention is an 
important first step in correlating those sequences with genes associated with 
disease. 

15 In certain preferred embodiments in this regard, the cDNAherein disclosed 

is used to clone genomic DNA of a METH1 or METH2 protein gene. This can 
be accomplished using a variety of well known techniques and libraries, which 
generally are available commercially. The genomic DNA then is used for in situ 
chromosome mapping using well known techniques for this purpose. 

20 In addition, in some cases, sequences can be mapped to chromosomes by 

preparing PCR primers (preferably 1 5-25 bp) from the cDNA. Computer analysis 
of the 3' untranslated region of the gene is used to rapidly select primers that do 
not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers are then used for PCR screening of somatic 

25 cell hybrids containing individual human chromosomes. 

Fluorescence in situ hybridization ("FISH") of a cDNA clone to a 
metaphase chromosomal spread can be used to provide a precise chromosomal 
location in one step. This technique can be used with probes from the cDNA as 
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short as 50 or 60 bp. For a review of this technique, see Verma et al % Human 
Chromosomes: A Manual Of Basic Techniques, Pergamon Press, New York 
(1988). 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in V. McKusick, Mendelian 
Inheritance In Man, available on-line through Johns Hopkins University, Welch 
Medical Library. The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDNA or genomic 
sequence between affected and unaffected individuals. If a mutation is observed 
ih~sbme"or all of theaffected individuals but notin'any normal individuals, then the 
mutation is likely to be the causative agent of the disease. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Examples 

Example 1; Identification and cloning of METHI and METH2 

To search for novel genes with TSP-like domains, a large human cDNA 
database consisting of approximately 900,00 expressed sequence tags (ESTs) was 
screened for sequences homologous to the second type I repeat of TSP 1 . Several 
ESTs were predicted to encode proteins with TSP-like domains. Two cDNA 
clones originated from human heart and lung libraries were further sequenced and 
chosen for functional analysis. 
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The amino-terminal end of METH1 was obtained using 5' rapid 
amplification of cDNA ends (RACE) PCR technique (Marathon cDNA 
amplification kit, Clontech) according to manufacturer instructions. The amino- 
terminal end of METH2 was obtained partially through 5'RACE PCR and later 
confirmed and completed by genomic screening. For the genomic screen, BAC 
clones (Genome Systems) were initially identified by PCR. Positive BAC clones 
containing 150-200bp of sequence were subsequently subcloned into pGEM 
vector as small fragments and sequenced. 

Analysis and comparison of the deduced amino acid sequence with the 
GenBank, EMBL and SwissProt databases suggested that these genes belong to 
a new family of metalloproteases with homology to the reprolysin family in their 
NH2 -terminal end and with several TSP-like motifs in the COOH-terminal end. 
-These cDNAs-were named METH1 and METH2; ME, for metalloprotease and 
TH, for thrombospondin. The mouse homologue of METH1 was identified and 
named ADAMTS1 (Kuno, K., eta!., J. Biol Chem. 272:556-562 (1 997)). Direct 
comparison of the human and mouse sequences revealed a high level of 
conservation (83.4% amino acid identity). Thus far no homologues for METH2 
have been identified. 

Interestingly, a recently identified protein named pNPI (procollagen 1 N- 
proteinase; (Colidge, A., etal, Proc. Nail Acad. ScL USA 94:2314-2219 (1997)) 
showed a striking sequence and structural similarity to METH1 and METH2 
(Figure 3). As the novel proteins described here, pNPI also contains 
metalloproteinase (reprolysin subfamily) and TSP domains at the carboxy-terminal 
end. Although the sequence for pNPI is of bovine origin, sequence alignment 
revealed identical structural features. The amino acid similarity between METH 1 
and METH2 is 5 1 .7%, and between METH1 or METH2 and pNPI the homology 
is lesser 33.9% and 36.3%, respectively. 

Sequence analysis showed that the ORF of METH1 and METH2 coded 
for proteins of 950 and 890 amino acids, respectively. In all three proteins, the 
NH 2 terminal end contains a putative signal peptide followed by another putative 
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transmembrane domain around amino acid 300, deduced from the hydrophilicity 
plots. It is not clear whether these proteins are bound to the membrane. 
However, given preliminary data, it is more likely that this second transmembrane 
domain will consist of a hydrophobic pocket and that METH1, METH2 and pNPI 
are in fact secreted proteins. The NH 2 -terminal end past the signal peptide has 
homology to the superfamily of zinc metalloproteases and can be subdivided in a 
prodomain, a metalloprotease domain, and a cysteine-rich region. 

The double underlined sequence in METH1 and METH2 in Figure 3 
localized at the boundary between the prodomain and the metalloprotease domain, 
are potential cleavage sites for mammalian subtilisins, such as furins (Barr, 1 99 1 ). 
Proteolytical processing occurs in SVMPs to yield soluble metalloproteases and 
disintegrins (Bjarnason, J.B. & Fox, J.W., Methods Enzymol. 245:345-368 

— (1 995)) and has also been detected in some ADAMs (reviewed by Wolsberg, TrGr— 

& White, J.M, Developmental Biology 750:389-401 (1996)). At this point, 
preliminary experiments suggest that proteolytical processing occurs, at least in 
METH1. Additionally, both METH1 and METH2 present a Zn 2+ -binding site 
(dotted line in Figure 3) that is presumed to be catalytically active due to the 
conservation of certain functionally important amino acids (Rawlings, N.D. & 
Barrett, A.J., Methods Enzymol 248: 183-228 (1995)) suggesting that these 
proteins may be active proteases. Following the metalloprotease domain, there 
is a cysteine-rich region which contains two putative disintegrin loops (Wolsberg, 
T.G. & White, J.M, Developmental Biology 750:389-401 (1996)) (marked by 
arrows in Figure 3). Disintegrin domains are found within the superfamily of 
metalloproteases in snake venom metalloproteases (SVMPs) and ADAMs 
(mammalian proteins containing a disintegrin and a metalloprotease domain) and 
have a possible function inhibiting binding of integrins to their ligands in SVMPs. 
Conversely, the ADAM-disintegrin-like domain, as part of membrane anchored 
proteins, may promote rather than disrupt, cell-cell interactions (Wolsberg, T.G. 
& White, J.M., Developmental Biology 750:389*401 (1996)). The TSP-like 
domains are located in the COOH-half of METH1 and METH2 proteins. METHl 
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contains two conserved TSP domains separated by a spacer region with unknown 
function, and a subdomain with less homology, and only 5 cysteines, following the 
second anti-angiogenic region. METH2 contains two TSP domains separated by 
the spacer region. The alignment of the TSP-Iike domains of METH1 and 
METH2 with those of TSP 1 and TSP2 are shown in Figure 5. The homology 
varies between 19.2% to 52% amino acid similarity among all the TSP repeats. 
The cysteines, numbered 1 to 6, and the tryptophans, labeled by asterisks, are 
highly conserved. 

Southern blot of human genomic DNA revealed the presence of METH1 
and METH2 in the genome. METH1 and METH2 probes revealed bands of 
different size suggesting that they are transcribed from different genes. 

The consensus sequence for the type I repeats includes 1 6 residues with 
6 perfectly conserved [.cysteines. ...Typically it begins. ^ith_the_sequence motif 
WSXWS (SEQ ID NO: 82) that has also been shown to bind to heparin (Guo, N., 
etal, J, Biol. Chem. 267:19349-19355 (1992)). The affinity of this region to 
heparin has been proposed to the part of the anti-angiogenic activity of TSP- 1 
(Guo, N., et al , J. Peptide Res. 49 ( 1 997)). Among the five members of the TSP 
family of proteins, only TSP- 1 and TSP-2 inhibit angiogenesis and contain the type 
I repeats (Tolsma, S.S., et at. t J. Cell Biol 722:497-511 (1993); Kyriakides, 
T.R., et oL. J. Cell Biol 740:419-430 (1998)). The type I or properdin repeats 
were probably added to the precursor of TSP 1 and 2 by exon shuffling between 
500 and 900 years ago (Adams, J., et al, The Thrombospondin Gene Family \ 1 
Ed. Molecular Biology Intelligence Unit (Springer, Ed.), R.G. Landes Company, 
Germany (1995)). It is likely that the acquisition of this domain provided the 
precursor of TSP 1 and TSP2 with functions, such as regulation of new vessel 
formation. More recently, B AI- 1 (brain angiogenic inhibitor- 1 ), a protein isolated 
from a brain library for its ability to be regulated by p53, has also been shown to 
contain the type I repeat of TSP- 1 and to provide anti-angiogenic potential to this 
molecule (Nishimori, H., etal t Oncogene 75:2145-2150 (1997)). Nevertheless, 
it appears that additional sequences or context are also important, since other 
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proteins containing the type I repeats appear not to have clear or more established 
anti-angiogenic properties such as: properdin, F-spondin, and other members of 

the complement family. 

Because of the presence of TSP-repeats in METHl and METH2, along 
with their anti-angiogenic properties, these proteins were originally considered 
membersoftheTSP superfamily. Nevertheless, they haveno additional homology 
to other TSPs, and in fact, the similarity to TSP1 and TSP2 is restricted to the 
type I repeats. Furthermore, the proteins also have strong sequence and structural 
homology to members of the ADAM family. These features led Kuno and 
colleagues to name ADAMTS to the mouse homolog of METHl (Kuno, K., et 
al. J. Biol. Chem. 272.556-562 (1997)). The recent identification of pNPl and 
its striking sequence homology to theprotdns here described, prompt all these 
three proteins to be grouped in a subfamily named metallospondins. At this point, 
it is not clear whether pNIP has anti-angiogenic properties or whether METHl 
and/or METH2 participate in the cleavage of the amino terminal pro-peptide of 
a 1(1) procollagen. 

Example 2: Northern and Southern blot analysis 

Total RNA was purified from cells by guanidinium-isothiocyanate 
extraction, as previously described (Chomczynski, P. & Sacchi, R, Anal 
Biochem. 162 A56-159 (1987)) Poly(A>+RNA was extracted using a Boehnnger 
Mannheim (BMB, Indianapolis, IN) kit according to the manufacturer conditions. 
Other poly(A)+RNA blots were purchased from Clontech (Palo Alto, CA). Pre- 
hybridization was performed in a solution containing: 50% formamide, 6X SSPE, 
IXDenhardfs solution, 0. 1% SDS and lOOug/ml of heat denatured salmon sperm 
DNA for 12-18h at 42°C. Hybridization with labeled cDNA probes proceeded 
inthesamesolutionat42°Cfor 12-18h. TSP1 and METHl probes corresponded 
to the entire human cDNAs. METH2 probe corresponded to a Kpnl-EcoRl 
fragment from the human cDN A. A 1.3Kb ^fragment of the glyceraldehyde-3- 
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phosphate-dehydrogenase (GPDH) was used to normalize for loading and transfer 
efficiency. Membranes were exposed to Kodak Biomax MS film (Kodak, New 
Haven, CT). 

For Southern blots, human genomic DNA, purchased from Promega 
5 (Madison, WI), was heated at 65°C for 1 0 min and digested with EcoKi and Pstl 

overnight at 37 °C. 5ug of digested DNA was separated in a 1% agarose gel, 
transferred to a nytran membrane and cross-linked by ultraviolet light. cDNA 
probes, as well as, prehybridization and hybridization conditions were identical to 
those described for Northern blots. Blots were washed with high stringency (0.2X 

10 SSC, 0.2% SDS at 50 °C). 

The expression pattern of METH1 and METH2 was examined in both 
adult and embryonic tissues. Northern blot analysis was performed under high- 

- • ■ ' stringency conditicJns with blots that included poly(A)+RNA from human tissues. - 

METH1 and METH2 transcripts revealed a single band of 4.6 and 3.7Kb, 

15 respectively. Abundant METH1 mRNA expression was observed in adrenal, 

heart, placenta, followed by skeletal muscle, thyroid and stomach. From the 
embryonic tissues analyzed, kidney showed the highest expression of METH1 
mRNA. Nevertheless, weaker expression of METH1 mRNA was seen in all 
tissues analyzed. Distribution of METH2 mRNA was more restricted and weaker 

20 than that of METH1 . The highest expression was seen in lung, both embryonic 

and adult. Interestingly, METH1 and METH2 expression do not appear to 
overlap. In combination, the structural similarities and their pattern of expression 
suggest functional redundancy yet different transcriptional regulation. The 
expression levels of TSP1 transcripts in the same blots were also analyzed, for 

25 purpose of comparison. TSP 1 mRNA highest expression was seen in the adult 

placenta and in all embryonic tissues analyzed. In contrast to METH1 and 
METH2 we observed constant levels of TSP 1 transcript in all the other tissues 
examined. 

The cell type distribution was also studied by Northern blot analysis of 
30 poly(A)+RNA. METH1 mRNA was detectable, at low levels, in dermal 
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fibroblasts, vascular smooth muscle, endometrial stromal cells, and in two cancer 
cell lines, HeLa and G631, an adenocarcinoma and a melanoma, respectively. 
METH2 mRNA was detected only on SW480, a colon carcinoma cell line, but no 
expression was seen in any other of the cell lines or primary strains analyzed. 
5 The possibility that groups of angiogenic and anti-angiogenic factors 

regulate vascular network formation in specific organs has been a frequently 
discussed hypothesis likely to be true, yet unproven. The expression patterns of 
METH1 and METH2, which are clearly distinct and almost non-overlapping, were 
puzzling, at least with concern to overall levels. TSP1 and TSP2 also share 
10 identical structure, high level of amino acid similarity, yet their pattern of 

expression differs significantly (Iruela-Arispe, M.L.Dev. Dyn. 7P7.40-56 (1993)). 
The differences are likely based on dissimilar cis-acting elements in their 
promoters and different regulatory mechanisms, as~ previously suggested. ~ 
Although the promoters for METH1 and 2 have not been characterized, it is likely 
15 that they provide unique features for the regulation of each gene. Nevertheless, 

the possibility that one motif, the anti-angiogenic / type I repeat, with 
demonstrated anti-angiogenic properties is present in several proteins with 
different tissue specificities is appealing. Alternatively, the small differences in 
sequence between closely related members of the same family could possess 
20 significance that goes beyond functional redundancy. In the case of TSP1 and 

TSP2, aside from the striking structural similarities and perhaps having 
functionally common anti-angiogenic properties, TSP1 and TSP2 also appear to 
display functions of their own and not likely shared by their similar relative. This 
became evident with the outcome of the two knock-outs for these genes. TSP 1 
25 null animals exhibited primarily lung disorders (Lawler, J., et al, J. Clin. Invest. 

707:982-992 (1998)) and secondarily vascular abnormalities, but only under 
specific pathological settings or on a restricted set of organs. In contrast TSP2 
knock-out mice exhibited unpredicted collagen assembly anomalies, with carry-on 
consequences to the skin, tendons, and bone (Kyriakides, T.R. , et al. , J. Cell Biol. 
30 740.419-430 (1998)). In addition, these animals also appear to have overall 
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increase in capillary density in the dermis. It is not understood how the 
resemblance between the newly described members of the metallospondin family 
translate functionally. Clearly, pNIP has been shown to display active proteolytic 
activity by cleaving the N-terminus of type 1 procollagen (Colidge, A. , et al. . Proc. 
Natl. Acad. Sci. USA 9*2374-2379 (1997)). 

A second region of functional interest corresponds to the disintegrin 
domain. This domain has been more fully characterized in related members of the 
snake venom metalloproteases that have been shown to bind to ccllbp3 and inhibit 
platelet interaction blocking coagulation (Pfaff, M.. et al.. Cell Adhes Commun. 
2:491-501 (1994); Usami, Y., etai. Biochem. Biophys. Res. Commun. 207:331- 
339 (1 994)). The disintegrin motif consists of a thirteen to fifteen domain which 
frequently contain an RGD or a negatively charged residue at the position of the 
" aspartic acid. The RGD,^>r equivalent, binds to integrins and serve as antagonist 
or signaling ligands (Wolsberg, T.G. & White, J.M., Developmental Biology 
750:389-401 (1996)). METH2, but not METH1, has an RGD sequence located 
amino-terminal to the disintegrin domain. In addition, both molecules present 
relatively high, but not perfect, degree of conservation of cysteines within the 
disintegrin motif. This appears to display an important role in the tertiary 
structure of this region and its ability to interact with integrins. In addition, some 
of these domains have been shown to act as functional adhesion molecules, 
particularly those with transmembrane regions (Wolsberg, T.G. & White, J.M., 
Developmental Biology 750:389-401 (1996)). It is unlikely that this will be the 
case for METH1 and METH2, since both these proteins appear to be secreted. 

Example 3:Jjcpression and purification of recombinant proteins 

Recombinant constructs for expression of truncated fusion proteins were 
as follows: (1) pRSET-METHl -Type I: METH1 nt 1605-1839 (from the start 
codon) was amplified by polymerase chain reaction using the following primers: 
5 '-GCA TTT TGG ATC CGC CTT TTC ATG-3 ' (SEQ ID NO:78) and 5 '-GTT 
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GTG TGC TGC AGA TTG TTC C-3 ' (SEQID NO:79). The amplified fragment 
was then subcloned into the BamHl and Pstl sites of the pRSET vector; (2) 
pGEX-METHl-TSP was generated by ligating the BamHl-EcoRl fragment from 
the pRSET-METH 1 -TSP into the Smal site of the pGEX-5X vector (Pharmacia 
Biotech Inc., Piscataway, NJ) by blunt-end ligation; (3) pGEX-1 .0-METH2: the 
fragment nt 83 8- 1 8 1 8 of METH2 cDN A (from the start codon) was ligated into 
Bamm-EcoKl sites of pGEM-2TK. The METH2 fragment was amplified by PCR 
using the following primers: 5 '-GAAAAATGGGGATCCGAGGTG-3 ' (SEQID 
NO.80) and 5 '-GC AGGAGAATTCCGTCC ATG-3 ' (SEQDDNO.81) to generate 
BamHl and EcoRl restriction sites; (4) pGEX-METH2-TSP: a 0.5Kb Xma\- 
EcoRl fragment isolated from pGEX-1.0-METH2 was subcloned into theXmal 
and EcoRl sites of pGEX-2TK vector. All constructs were sequenced to verify 
sequence fidelity and correct open reading frame. 

The recombinant proteins were named 6H-METH1, the recombinant 
protein expressed with the plasmid pRSET-METHl-TSP, GST-METH1, the 
protein expressed with the plasmid pGEX-METHl-TSP and GST-METH2, the 
protein expressed with the plasmid pGEX-METH2-TSP. 

Expression plasmids were transformed into BL21:DE3 E. coli strain 
(Stratagene Cloning Systems, La Jolla, CA) and fusion proteins were induced 
following manufacturer recommendations. Briefly, induced bacteria pellets were 
resuspended in PBS and sonicated on ice for 1 min. The suspension was, 
subsequently, incubated at RT for 20min in the presence of 1% triton X-100 and 
centrifuged at 4°C. Histidine tagged fusion proteins were then purified on Ni- 
NT A beads (Qiagen, Chatsworth, C A) by incubating 20ml of supernatant with 1 ml 
of beads (50% slurry) for 2h at 4°C. The suspension was transferred into a 
column and washed with 1 0 columns volume of PBS containing 1 OmM imidazole, 
followed by 50mM imidazole and finally lOOmM imidazole. The protein was 
eluted with 500mM imidazole in PBS. Fractions containing the recombinant 
protein were dialyzed against phenol-red free DMEM. Samples were centrifuged 
for 30min at 4°C, part of the protein was not soluble and was lost during 
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centrifugation. The supernatant was stored at -70°C and used for proliferation, 
cornea pocket and chorioallantoic membrane (CAM) assays. 

For purification of GST-fusion proteins, the extract was cleared by 
centrifugation and applied to a GST-affinity column (Pharmacia). The column 
5 was washed with PBS-1% triton X-100 in the presence of 0.1 mM reduced 

glutathione and, subsequently, with the same buffer in the presence of O.SmM 
reduced glutathione. Fusion proteins were eluted with 1 OmM reduced glutathione 
in 50mM Tris-HCl, pH 7.5 . Fractions containing the protein were dialyzed against 
DMEM, stored at -70°C and used for proliferation, cornea pocket and 
10 chorioallantoic membrane (CAM) assays. 

Integrity and purity of recombinant proteins was analyzed in 12.5% or 
1 5% acrylamide gels stained with Coomassie blue. 

A recombinahTGST fusion protein containing the first two type I repeats 
of TSP was also dialyzed against DMEM before used in functional assays. Intact 
TSP1 was purified from platelets as previously described (Roberts, D.D., et al, 
J. Tissue Cult. Methods 16:217-222 (1994)). 

To test the hypothesis that METH1 and METH2 TSP domains could 
functionas regulators ofangiogenesis recombinant fusion proteins were generated 
in bacteria. The constructs included the first TSP domain of METH1 or METH2. 
This domain is the most conserved, 52% amino acid similarity with the second 
type I repeat of TSP 1, (this domain contains a putative binding site for CD36). 
All recombinant proteins were isolated under native conditions to preserve their 
secondarystructureas much as possible. 6H-METH1 and GST-METH1 contained 
the first TSP-like domain of METH1 fused to a histidine tag or a GST, 
25 respectively. METH1 recombinant protein was made with two different tags 

because of purification and structural advantages. The differences in size are due 
to the size of the tag, 6KDa the histidine and 27KDa the GST. GST-METH2 
contained the first TSP domain of METH2 also fused to a GST. A fragment 
corresponding to the last two type 1 repeats of TSP1, also fused to a GST, and 



15 



20 



WO 99/37660 



-123- 



PCT/US99/01313 



intact TSP1 purified from platelets were used as positive controls. In addition, 
GST alone was included in all experiments as negative control. 

Example 4: TSP domains in METH1 andMETH2 disrupt angiogenesis in vivo 
Cornea pocket assay 

5 Swiss Webster females and males, were purchased from Charles River 

(Boston, MA) and used between 8-10 weeks-old for implantation of the pellets. 
Cornea pockets were performed as described by Kenyon and colleagues (Kenyon, 
B.M., et al.. Invest. Ophthalmol. Vis. Sci. 37:1625-1632 (1996)) with few 
modifications. Briefly, a solution of lOug of recombinant bFGF plus 5 mg of 

10 sucralfate were mixed with lOul of Hydron (200mg/ml in ethanol; New 

BrunswickTNl) "and" the recombinant protein of interest (2ug). The suspension 
was then smeared onto a sterile nylon mesh square (pore size 500um; Tetko Inc., 
BriarcluTManor, NY) and allowed to dry for 30min. The fibers of the mesh were 
pulled to produce pellets of SOOum 3 that were stored at -20°C. Uniformly sized 

15 pellets were selected under a microscope and used for the assays. 

Mice were anesthetized with Avertin. An incision was made in the cornea 
using a Nikon SMZ-U dissecting microscope with the aid of a surgical blade. A 
single pellet was implanted into the pocket. Five days after pellet implantation, 
corneal angiogenesis was evaluated and photographed. 

20 CAM assay 

Chorioallantoic membrane assays were performed on Leghorn chicken 
embryos (SPAFAS, MA) at 12-14 days of embryonic development. Matrigel 
(750ug/mJ), VEGF (250ng/mesh) and the protein or peptide to be tested were 
mixed, placed onto nylon meshes (pore size 250um; Tetko Inc.) and incubated 

25 sequentially at 37°C for 30min and at 4°C for 2h to induce polymerization. A 

positive (matrigel and VEGF) and a negative (VEGF alone) control were also 
prepared for each CAM. Polymerized meshes were placed onto the third outer 
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region of the CAM and incubated for 24h. To visualize vessels, 400ul of 
fluorescein isothiocyanate dextran (lOmg/ml, SIGMA) was injected in the chick 
blood stream. After 5-10min incubation, the chick was topically fixed with 3 .7% 
formaldehyde for 5min. The meshes were then dissected and mounted onto slides. 
Fluorescence intensity was analyzed with a computer-assisted image program 

(NTH Image 1.59). 

Peptides used on these assays were synthesized by Chiron (Raleigh, NC). 
Sequence corresponded to amino acids: P-TSP1, 430-447; P-METH1, 549-563; 

P-METH2, 529-548. 

The evaluation of angiogenic or anti-angiogenic responses relies heavily 
on the sensitivity and specificity of the assays used to assess the response. To 
evaluate the anti-angiogenic activity of these fragments in vivo, two popular and 
well-accepted angiogenesis assays were used: the corneal pocket-and the 
chorioallantoic membrane. The visibility, accessibility, and avascularity of the 
cornea are highly advantageous and facilitate the visualization of the neovascular 
response and the topical application of the test substances. A known amount of 
angiogenesis factor(s) is implanted, as a pellet, in a pocket made in the cornea eye. 
To test an angiogenesis inhibitor, the molecule is implanted with the stimulator in 
the same pellet, and the response is compared to the stimulator alone. 

In these experiments, bFGF was used as the vascularization stimulator. 
Pellets containing the recombinant protein were implanted in mouse corneas and 
their ability to inhibit the bFGF-induced angiogenic response was compared to that 
of controls. When a bFGF pellet containing GST was implanted new capillary 
vessels grew from the cornea limbus, across the cornea and into the pellet within 
5 days. In contrast, addition of GST-METH1 or GST-METH2 to the bFGF 
pellets completely abolished blood vessel growth. Table 4 contains a summary of 
the results obtained from 41 assays performed. IntactTSPl purified from platelets 
and GST-TSP1 were used as positive controls. All assays were performed at 
identical concentrations, suggesting that METH1 and METH2 have similar 
potency to that of TSP1 in the inhibition of angiogenesis. In addition, when half 
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of the standard concentration was used, a weak, however noticeable response was 
seen, indicating a dose-dependent effect. 



Table 4. 

Activity of METH1 and METH2 recombinant proteins in the corneal 

pocket assay 


bFGF Pellets 


Vascularized corneas/Total corneas 


Vehicle 


5/5 


TSP1 


0/5 


GST 


11/11 


GST-TSP1-T1 


1/4 


GST-METH 1 -TSP 


0/8 


GST-METH2-TSP 


0/8 
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In the CAM assay, the angiogenic response is analyzed by measuring the 
number of vessels that grow within a matrix polymer containing the angiogenic 
growthfactor. To determine whether recombinant METH 1 and METH2 proteins 
inhibited neovascularization in the CAM assay induced by VEGF, a matrigel 
polymer containing VEGF and the recombinant protein were implanted in the 
CAM. Quantitative analysis of the experiments, which included three different 
polymers per treatment are shown in Figure 6A. Matrigels polymers containing 
VEGF plus 5^g of GST-METH 1 or GST-METH2 caused greater than 80% 
inhibition in blood vessel growth. A similar potency was found using the GST 
recombinant protein derived from the type I repeats of TSP1. Furthermore, the 
anti-angiogenic effect of the TSP domains in METH1 and METH2 was dose- 
dependent with a complete inhibition of blood vessel growth when 15 /^g/ml of 
protein was used (Figure 6C and D). GST alone, at identical concentrations, had 
no significant effect on VEGF-stimulated angiogenesis. 
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Synthetic peptides from the second or the third type I repeats of human 
TSP1 can mimic that anti-angiogenic effects of the intact TSP 1 (Tolsma, S.S., et 
al, J. Cell. Biol 122A91-5U (1993)). In fact, a 19-residue polypeptide was 
shown to be sufficient to block in vivo neovascularization in the rat cornea and to 
inhibit the bFGF-induced migration of cultured endothelial cells (Vogel, T., et al, 
J. Cell Biochem, 53:74-84 (1993); Tolsma, S.S., etaL, J. Cell Biol 722:497-51 1 
(1993)). To test whether the same was true for the METH1 and METH2 TSP 
domains, peptides derived from the same region were synthesized and their anti- 
angiogenic activity was evaluated in the CAM assay. The results are shown in 
Figure 6B. Peptides derived from both the TSP domain of METH1 and METH2 
blocked VEGF-induced angiogenesis similarly to that of TSP 1. In contrast, 
scramble peptides had no significant effects. 



Example 5; Proliferation assays 

Human dermal endothelial cells (HDEC) were isolated and grown on 
Vitrogen™ coated petri-dishes in EBM (Clonetics, San Diego, CA) supplemented 
with 15% fetal calf serum, 25ng/ml cAMP, and lug/ml of hydrocortisone-2 1 - 
acetate and were used from passages 3 to 6. Cells were made quiescent by 
incubation of confluent monolayers with phenol red-free EBM containing 0.2% 
BSA for 48h. Human dermal fibroblasts were isolated from neonatal foreskin and 
by enzymatic dissociation. Both fibroblasts and smooth muscle cells were 
maintained in DMEM supplemented with 10% fetal calf serum. Human mammary 
epithelial cells (HMEC) were purchased from Clonetics and maintained in the 
recommended media (mammary epithelial growth media, MEGM). 

Quiescent human dermal endothelial cells, between passage 3 and 6, were 
plated on Vitrogen™ coated 24-well plates in EBM supplemented with 0.2% 
BSA, 0.1% fetal calf serum and 1 ng/ml of bFGF in the presence or absence of the 
recombinant protein and incubated at 5% C0 2 at 37°C for 48h. For vascular 
smooth muscle (VSM) and fibroblast proliferation assays, cells were incubated 
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under the same conditions but using DMEM instead of EBM. Human mammary 
epithelial cells were incubated on their growth media. A pulse of [ 3 H]-Thymidine 
(luCi/nl) was added during the last 4h prior harvesting. Cells were washed and 
fixed in 10% TCA. Incorporation of [ 3 H]-thymidine was determined by 

5 scintillation counting, as previously described (Iruela-Arispe, M.L. & Sage, E.H., 

J. Cell. Biochem. 52:414 (1993)). 

Statistical analysis were done using In- Stat software (Graph Pad Software) 
for Macintosh. Assuming normal distributions, data were analyzed by one-way 
ANOVA, followed by either T-test Dunnett test for comparisons between groups, 

10 or student-Newman-Kleus test for multiple comparisons between groups. 

To gain insight into the mechanism by which METH1 and METH2 inhibit 
neovascularization, the direct effect of the purified recombinant fusion proteins on 
endothelial cell"proliferation was tested. Serum-starved endothelial cells were 
plated into growth medium containing bFGF and FCS. Recombinant proteins 

1 5 (3 jig/ml) were added at the same time of plating. 40% (GST-METH 1 ), 45% (6H- 

GST) or 36% (GST-METH2) inhibition was observed, in contrast to a non- 
significant effect when GST alone was added. The recombinant protein from the 
type I repeats of TSP1 had similar inhibitory effects. (Figure 7A). Furthermore, 
suppression of proliferation mediated by METH1 or METH2 were dose- 

20 dependent, as shown in Figure 7E. The inhibition was observed as early as one 

day after treatment and the inhibitory effect was not toxic and reversible since the 
removal of the recombinant protein and subsequent addition of growth factor 
alone led to the resumption of endothelial cell proliferation. 

The cell specificity of the antiproliferative effects for METH1 and 

25 METH2 on the endothelium was evaluated by additional proliferation assays on 

a variety of non-endothelial cells. No significant inhibition of proliferation was 
seen on fibroblasts or smooth muscle cell cultures. In contrast, a non significant, 
but reproducible stimulation of proliferation for these two cell types could be 
observed. This result rules out the presence of any potential nonspecific inhibitor 

30 of cell growth in the recombinant protein preparations. On mammary epithelial 
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cell, however, METH1 and METH2 inhibited cell proliferation to the same degree 
as to endothelial cells. Interestingly, TSP1 also suppresses mammary epithelial 
cell proliferation both in vitro and in a transgenic model. 

The possibility that METH1 and METH2 might act as disintegrins is 
consistent with their anti-angiogenic properties. Clearly blockade of ccvP3 and P 1 
integrins with antibodies has been shown to inhibit neovascularization both during 
development and in tumors (Brooks, PC, etal, Cell 55:683-693 (1996); Brooks, 
P.C., etal. Cell 92:391-400 (1998); Senger, D.R., etal, Proc. Natl. Acad. Sci. 
USA 94: 13612-13617 (1997)). Integrins are essential for the mediation of both 
proliferative and migratory signals (Schwartz, M.A. & lngber, D.E., Mol. Biol. 
Cell 5:389-393 (1994)), therefore interference with those signals can be highly 
deleterious to the angiogenic process. The angiogenic functional assays were 

— - performed with recombinant protein containing only the.type I repeats inJMETHl 

and METH2. 

The mechanism of action of METH1 and METH2 with regards to their 
angio^inhibitory activity is not known. To date we have evidence that these 
proteins are secreted and bind to endothelial cells. Further investigations are 
guided towards the identification of receptors and signal transduction mechanisms. 
A likely hypothesis resulting from the lessons learned from TSP1 is that both 
METH1 and METH2 bind to CD36. Recently, this scavenger receptor has been 
implicated in the mediation of signals by which TSP-1 exert its anti-angiogenic 
effects (Dawson, D.W., et al., J. Cell. Biol. 735:707-717 (1997)). Both the 
CSVTCG (SEQ ID NO:83) (Asch, AS., et al.. Nature 262:1436-1439 (1993); 
Catimel, B., et al.. Biochem. J. 284:231-236 (1992)) and the GCQXR (SEQ ID 
NO:84) sequences have been proposed as primary binding motifs to CD36 
(Dawson, D.W., etal, J. Cell. Biol. 1 38:101-111 (1997)). METH1 and METH2 
have almost entire conservation in both these regions. A complementary and also 
likely occurrence is binding of METH1 and METH2 to bFGF. Binding to heparin 
and bFGF has been proposed as part of the anti-angiogenic activity of TSP 1 (Guo, 
N., et al., J. Peptide Res. 49 (1997)). This property appears to be mediated 
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through the WSXWS (SEQ ID NO:82) motif, also conserved in METH1 and 
METH2. Future efforts will focus on the signals implicated in the anti- angiogenic 
properties mediated by these novel proteins and on their potential as proteases of 
the extracellular milieu. 

Example 6: Isolation of the METH1 or METH2 cDNA Clone From the 
Deposited Sample 

Two approaches can be used to isolate METH1 or METH2 from the 
deposited sample. First, the deposited clone is transformed into a suitable host 
(such as XL-1 Blue (Stratagene)) using techniques known to those of skill in the 
art, such as those provided by the vector supplier or in related publications or 
patents. The transformants are plated on 1.5% agar plates (containing the 
appropriate selection agent, e.g., ampicillin) to a density of about 150 
transformants (colonies) per plate. A single colony is then used to generate DNA 
using nucleic acid isolation techniques well known to those skilled in the art. (e.g., 
Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edit., (1989), 
Cold Spring Harbor Laboratory Press.) 

Alternatively, two primers of 1 7-20 nucleotides derived from both ends of 
the SEQ ED NO: 1 or SEQ ID NO:3 (i.e., within the region of SEQ ID NO: 1 or 
SEQ ID NO: 3 bounded by the 5' NT and the 3' NT of the clone) are synthesized 
and used to amplify the METH1 or METH2 cDNA using the deposited cDNA 
plasmids as templates. The polymerase chain reaction is carried out under routine 
conditions, for instance, in 25 pi of reaction mixture with 0.5 pg of the above 
cDNA template. A convenient reaction mixture is 1 . 5-5 mM MgCl 2 , 0.01 % (w/v) 
gelatin, 20 uM each of d ATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94 
degree C for 1 min; annealing at 55 degree C for 1 min; elongation at 72 degree 
C for 1 min) are performed with a Perkin-Elmer Cetus automated thermal cycler. 
The amplified product is analyzed by agarose gel electrophoresis and the DNA 
band with expected molecular weight is excised and purified. The PCR product 
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is verified to be the selected sequence by subcloning and sequencing the DNA 
product. 

Several methods are available for the identification of the 5* or 3' non- 
coding portions of the METH1 or METH2 gene which may not be present in the 
deposited clones. These methods include but are not limited to, filter probing, 
clone enrichment using specific probes, and protocols similar or identical to 5' and 
3' "RACE" protocols which are well known in the art. For instance, a method 
similar to 5' RACE is available for generating the missing 5' end of a desired full- 
length transcript. (Fremont-Racine et aL y Nucleic Acids Res. 2/^:1683-1684 
(1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5* ends of a 
population of RNA presumably containing full-length gene RNA transcripts. A 
primer _set containing a primer specific to the ligated RNA oligonucleotide and a 
primer specific to a known sequence of the METH1 or METH2 gene of interest 
is used to PGR amplify the 5' portion of the METH1 or METH2 fiill-length gene. 
This amplified product may then be sequenced and used to generate the full length 
gene. 

This above method starts with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can then be treated 
with phosphatase if necessary to eliminate 5* phosphate groups on degraded or 
damaged RNA which may interfere with the later RNA ligase step. The 
phosphatase should then be inactivated and the RNA treated with tobacco acid 
pyrophosphatase in order to remove the cap structure present at the 5' ends of 
messenger RNAs. This reaction leaves a 5' phosphate group at the 5' end of the 
cap cleaved RNA which can then be ligated to an RNA oligonucleotide using T4 
RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction 
is used as a template for PCR amplification of the desired 5' end using a primer 
specific to the ligated RNA oligonucleotide and a primer specific to the known 
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sequence of the gene of interest. The resultant product is then sequenced and 
analyzed to confirm that the 5* end sequence belongs to the METHl or METH2 
gene. 

Example 7: Bacterial Expression of METHl or METH2 

A METHl or METH2 polynucleotide encoding a METHl or METH2 
polypeptide invention is amplified using PCR oligonucleotide primers 
corresponding to the 5' and 3* ends of the DNA sequence, as outlined in Example 
5, to synthesize insertion fragments. The primers used to amplify the cDN A insert 
should preferably contain restriction sites, such as BamHl and XbaL, at the 5' end 
of the primers in order to clone the amplified product into the expression vector. 
For example, BamHLand Xbal correspond to the restriction enzyme sites-on the 
bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, CA). Thisplasmid 
vector encodes antibiotic resistance (Amp'), a bacterial origin of replication (ori), 
an IPTG-regulatable promoter/operator (P/O), a ribosome binding site (RBS), a 
6-histidine tag (6-His), and restriction enzyme cloning sites. The pQE-9 

vector is digested with BamHl and Xbal and the amplified fragment is ligated into 
the pQE-9 vector maintaining the reading frame initiated at the bacterial RBS. 
The ligation mixture is then used to transform the E. coli strain Ml 5/rep4 (Qiagen, 
Inc.) which contains multiple copies of the plasmid pREP4, which expresses the 
lad repressor and also confers kanamycin resistance (Kan 1 ). Transformants are 
identified by their ability to grow on LB plates and ampicillin/kanamycin resistant 
colonies are selected. Plasmid DNA is isolated and confirmed by restriction 
analysis. 

Clones containing the desired constructs are grown overnight (O/N) in 
liquid culture in LB media supplemented with both Amp ( 1 00 ug/ml) and Kan (25 
ug/ml). The O/N culture is used to inoculate a large culture at a ratio of 1 : 1 00 to 
1 :250. The cells are grown to an optical density 600 (O.D. 600 ) of between 0.4 and 
0.6. IPTG (Isopropyl-B-D-thiogalacto pyranoside) is then added to a final 
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concentration of 1 mM. IPTG induces by inactivating the IacI repressor, clearing 
the P/O leading to increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 
agent 6 Molar Guanidine HC1 by stirring for 3-4 hours at 4 degree C. The cell 
debris is removed by centrifugation, and the supernatant containing the 
polypeptide is loaded onto anickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin 
column (available from Q1AGEN, Inc., supra). Proteins with a 6 x His tag bind 
to the Ni-NTA resin with high affinity and can be purified in a simple one-step 
procedure (for details see: The QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl," 
pH 8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, 
ti^JL w ^k^ Y^l* 1 10 volumes of 6 M gu^dine-HCl pH 6, and finally the 
polypeptide is eluted with 6 M guanidine-HCl, pH 5. 

The purified METH1 or METH2 protein is then renatured by dialyzing it 
against phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 
200 mMNaCl. Alternatively, the METH1 or METH2 protein can be successfully 
refolded while immobilized on the Ni-NTA column. The recommended conditions 
are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 
20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. The 
renaturation should be performed over a period of 1.5 hours or more. After 
renaturation the proteins are eluted by the addition of 250 mM immidazole. 
Immidazole is removed by a final dialyzing step against PBS or 50 mM sodium 
acetate pH 6 buffer plus 200 mM NaCl. The purified METH 1 or METH2 protein 
is stored at 4°C or frozen at -80 °C. 

In addition to the above expression vector, the present invention further 
includes an expression vector comprising phage operator and promoter elements 
operatively linked to a METH 1 or METH2 polynucleotide, called pHE4a. (ATCC 
Accession Number 209645, deposited February 25, 1 998.) This vector contains: 
1) a neomycinphosphotransferase gene as a selection marker, 2) aniT. coli origin 
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of replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 
5) a Shine-Delgarno sequence, and 6) the lactose operon repressor gene (laclq). 
The origin of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). 
The promoter sequence and operator sequences are made synthetically. 

DN A can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal, BamHI, Xhol, or Asp718, running the restricted product on a gel, and 
isolating the larger fragment (the stuffer fragment should be about 310 base pairs). 
The DN A insert is generated according to the PCR protocol described in Example 
5, using PCR primers having restriction sites for Ndel (5' primer) and Xbal, 
BamHI, Xhol, or Asp71 8 (3 1 primer). The PCR insert is gel purified and restricted 
with compatible enzymes. The insert and vector are ligated according to standard 
protocols. 

The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Example 8: Purification ofMETHl orMETH2 Polypeptide from an Inclusion 
Body 

The following alternative method can be used to purify METH1 or 
METH2 polypeptide expressed in Ecoli when it is present in the form of inclusion 
bodies. Unless otherwise specified, all of the following steps are conducted at 
4-10°C 

Upon completion of the production phase of the E. coli fermentation, the 
cell culture is cooled to 4-10 degree C and the cells harvested by continuous 
centrifiigation at 15,000 rpm (Heraeus Sepatech). On the basis of the expected 
yield of protein per unit weight of cell paste and the amount of purified protein 
required, an appropriate amount of cell paste, by weight, is suspended in a buffer 
solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed 
to a homogeneous suspension using a high shear mixer. 

The cells are then lysed by. passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The 
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homogenate is then mixed with NaCl solution to a final concentration of 0.5 M 
NaCl, followed by centrifugation at 7000 xg for 15 rnin. The resultant pellet is 
washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., 
the pellet is discarded and the polypeptide containing supernatant is incubated at 
4 degree C overnight to allow further GuHCI extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble 
particles, the GuHCI solubilized protein is refolded by quickly mixing the GuHCl 
extract with 20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM 
NaCl, 2 mM EDTA by vigorous stirring. The refolded diluted protein solution is 
kept at 4 degree C without mixing for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide "solution, a previously prepared 
tangential filtration unit equipped with 0. 1 6 um membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros 
HS-50, Perseptive Biosystems). The column is washed with 40 mM sodium 
acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1 000 mM, and 1 500 mM NaCl 
in the same buffer, in a stepwise manner. The absorbance at 280 nm of the 
effluent is continuously monitored. Fractions are collected and further analyzed 
by SDS-PAGE. 

Fractions containing the METH1 or METH2 polypeptide are then pooled 
and mixed with 4 volumes of water. The diluted sample is then loaded onto a 
previously prepared set of tandem columns of strong anion (Poros HQ-50, 
Perseptive Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) 
exchange resins. The columns are equilibrated with 40 mM sodium acetate, pH 
6.0. Both columns are washed with 40 mM sodium acetate,* pH 6.0, 200 mM 
NaCl. The CM-20 column is then eluted using a 1 0 column volume linear gradient 
ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 M NaCl, 50 mM 
sodium acetate, pH 6.5. Fractions are collected under constant A 280 monitoring 
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of the effluent. Fractions containing the polypeptide (determined, for instance, by 
16% SDS-PAGE) are then pooled. 

The resultant METH1 or METH2 polypeptide should exhibit greater than 
95% purity after the above refolding and purification steps. No major contaminant 
bands should be observed from Coomassie blue stained 1 6% SDS-P AGE gel when 
5 ug of purified protein is loaded. The purified METH1 or METH2 protein can 
also be tested for endotoxin/LPS contamination, and typically the LPS content is 
less than 0. 1 ng/ml according to LAL assays. 

Example 9: Cloning and Expression of METH1 or METH2 in a Baculovirus 
Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert METH1 
or METH2 p^lynucleotidFTnto a baculovirus to express METH1 or METH2. 
This expression vector contains the strong polyhedrin promoter o f the Autographa 
californica nuclear polyhedrosis virus (AcMNPV) followed by convenient 
restriction sites such as BamHI, Xba I and Asp718. The polyadenylation site of 
the simian virus 40 ("SV40") is used for efficient polyadenylation. For easy 
selection of recombinant virus, the plasmid contains the beta-galactosidase gene 
from E. coli under control of a weak Drosophila promoter in the same orientation, 
followed by the polyadenylation signal of the polyhedrin gene. The inserted genes 
are flanked on both sides by viral sequences for cell-mediated homologous 
recombination with wild-type viral DN A to generate a viable virus that express the 
cloned METH1 or METH2 polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, 
such as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily 
appreciate, as long as the construct provides appropriately located signals for 
transcription, translation, secretion and the like, including a signal peptide and an 
in-frame AUG as required. Such vectors are described, for instance, in Luckow 
et ah, Virology 170:31-39 (1989). 
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Specifically, the METH1 or METH2 cDNA sequence contained in the 
deposited clone, including the AUG initiation codon and any naturally associated 
leader sequence, is amplified using the PCR protocol described in Example 5. If 
the naturally occurring signal sequence is used to produce the secreted protein, the 
pA2 vector does not need a second signal peptide. Alternatively, the vector can 
be modified (pA2 GP) to include a baculovirus leader sequence, using the 
standard methods described in Summers et al., "A Manual of Methods for 
Baculovirus Vectors and Insect Cell Culture Procedures," Texas Agricultural 
Experimental Station Bulletin No. 1555 (1987). 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The 
fragment then is digested with appropriate restriction enzymes and again purified 
on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using 
routine procedures known in the art. The DNA is then isolated from a 1% 
agarose gel using a commercially available kit ("Geneclean" BIO 101 Inc., La 
Jolla, Ca.). ✓ 

The fragment and the dephosphorylated plasmid are ligated together with 
T4 DNA ligase. E. coli HB 1 0 1 or other suitable E. coli hosts such as XL- 1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid are 
identified by digesting DNA from individual colonies and analyzing the digestion 
product by gel electrophoresis. The sequence of the cloned fragment is confirmed 
by DNA sequencing. 

Five ug of a plasmid containing the polynucleotide is co-transfected with 
1.0 ug of a commercially available linearized baculovirus DNA ("BaculoGold 3 
baculovirus DNA", Pharmingen, San Diego, CA), using the lipofection method 
described by Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). 
One ug of BaculoGold 3 virus DNA and 5 ug of the plasmid are mixed in a sterile 
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well of a microtiter plate containing 50 ul of serum-free Grace's medium (Life 
Technologies Inc., Gaithersburg, MD). Afterwards, 10 ul Lipofectin plus 90 ul 
Grace's medium are added, mixed and incubated for 15 minutes at room 
temperature. Then the transfection mixture is added drop-wise to Sf9 insect cells 
(ATCC CRL 1711) seeded in a 35 mm tissue culture plate with 1 ml Grace's 
medium without serum. The plate is then incubated for 5 hours at 27 degrees C. 
The transfection solution is then removed' from the plate and 1 ml of Grace's 
insect medium supplemented with 10% fetal calf serum is added. Cultivation is 
then continued at 27 degrees C for four days. 

After four days the supernatant is collected and a plaque assay is 
performed, as described by Summers and Smith, supra. An agarose gel with 
"Blue Gal" (Life Technologies Inc., Gaithersburg) is used to allow easy 
- identification and isolation of gal-expressing clones, which produce blue-stained 
plaques. (A detailed description of a "plaque assay" of this type can also be found 
in the user's guide for insect cell culture and baculovirology distributed by Life 
Technologies Inc., Gaithersburg, page 9-10.) After appropriate incubation, blue 
stained plaques are picked with the tip of a micropipettor (e.g., Eppendorf). The 
agar containing the recombinant viruses is then resuspended in a microcentrifuge 
tube containing 200 ul of Grace's medium and the suspension containing the 
recombinant baculovirus is used to infect Sf9 cells seeded in 35 mm dishes. Four 
days later the supematants of these culture dishes are harvested and then they are 
stored at 4 degree C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 
medium supplemented with 1 0% heat-inactivated FB S . The cells are infected with 
the recombinant baculovirus containing the polynucleotide at a multiplicity of 
infection ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later 
the medium is removed and is replaced with SF900 II medium minus methionine 
and cysteine (available from Life Technologies Inc., Rockville, MD). After 42 
hours, 5 uCi of 35 S -methionine and 5 uCi 35 S-cysteine (available from Amersham) 
are added. The cells are further incubated for 1 6 hours and then are harvested by 
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centrifugation. The proteins in the supernatant as well as the intracellular proteins 
are analyzed by SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 
purified protein may be used to determine the amino terminal sequence of the 
produced METH1 or METH2 protein. 

Example 10: Expression ofMETHl or METH2 in Mammalian Cells 

METH1 or METH2 polypeptide can be expressed in a mammalian cell. A 
typical mammalian expression vector contains a promoter element, which mediates 
the initiation of transcription of mRNA, a protein coding sequence, and signals 
required for the termination of transcription and polyadenylation of the transcript. 
Additional elements include enhancers, Kozak sequences and intervening 
sequences flanked by donor and acceptor sites for KNA splicing. Highly efficient 
transcription is achieved with the early and late promoters from SV40, the long 
terminal repeats (LTRs) from Retroviruses, e.g., RSV, HTLVI, HI VI and the 
early promoter of the cytomegalovirus (CMV). However, cellular elements can 
also be used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention 
include, for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, 
Sweden), pRSVcat (ATCC 37152), pSV2DHFR (ATCC 37146), pBC12MI 
(ATCC 67109), pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells 
that could be used include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 
and C127 cells, Cos 1, Cos 7 and CV1, quail QC1-3 cells, mouse L cells and 
Chinese hamster ovary (CHO) cells. 

Alternatively, METH1 orMETH2 polypeptide can be expressed in stable 
cell lines containing the METH1 or METH2 polynucleotide integrated into a 
chromosome. The co-transfection with a selectable marker such as DHFR, gpt, 
neomycin, hygromycin allows the identification and isolation of the transfected 
cells. 
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The transfected METH1 or METH2 gene can also be amplified to express 
large amounts of the encoded protein. The DHFR (dihydrofolate reductase) 
marker is useful in developing cell lines that carry several hundred or even several 
thousand copies of the gene of interest. (See, e.g., Alt, F. W. } et al y J. Biol 
5 Chem. 253:1357-1370 (1978); Hamlin, J. L. and Ma, C, Biochem. et Biophys. 

Acta 7097:107-143 (1990); Page, M. J. and Sydenham, M. A., Biotechnology 
9:64-68 (1991).) Another useful selection marker is the enzyme glutamine 
synthase (GS) (Murphy et al , Biochem J, 22 7:277-279 (1 99 1); Bebbington et al , 
Bio/Technology 70:169-175 (1992). Using these markers, the mammalian cells 

10 are grown in selective medium and the cells with the highest resistance are 

selected. These cell lines contain the amplified gene(s) integrated into a" 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-DHFR (ATCC Accession No. 37146), 

15 the expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC 

Accession No. 209647) contain the strong promoter (LTR) of the Rous Sarcoma 
Virus (Cullen et al, Molecular and Cellular Biology, 438-447 (March, 1 985)) 
plus a fragment of the CMV-enhancer (Boshart et al, Cell 47:521-530 (1985).) 
Multiple cloning sites, e.g., with the restriction enzyme cleavage sites BamHI, 

20 Xbal and Asp7 1 8, facilitate the cloning of METH1 or METH2. The vectors also 

contain the 3* intron, the polyadenylation and termination signal of the rat 
preproinsulin gene, and the mouse DHFR gene under control of the SV40 early 
promoter. 

If a naturally occurring signal sequence is used to produce a secreted 
25 protein, the vector does not need a second signal peptide. Alternatively, if a 

naturally occurring signal sequence is not used, the vector can be modified to 
include a heterologous signal sequence in an effort to secrete the protein from the 
cell. (See, e.g., WO 96/34891.) 

The amplified fragment is then digested with the appropriate restriction 
30 enzyme and purified on a 1% agarose gel using a commercially available kit 



WO 99/37660 



-140- 



PCT/US99/01313 



("Geneclean," BIO 101 Inc., La Jolla, Ca.). The isolated fragment and the 
dephosphorylated vector are then ligated with T4 DNA ligase. E. coli HB 1 0 1 or 
XL-1 Blue cells are then transformed and bacteria are identified that contain the 
fragment inserted into plasmid pC6 or pC4 using, for instance, restriction enzyme 
analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 
transfection. Five \i% of the expression plasmid pC6 or pC4 is cotransfected with 
0.5 ug of the plasmid pS Vneo using lipofectin (Feigner et al., supra). The plasmid 
pS V2-neo contains a dominant selectable marker, the neo gene from Tn5 encoding 
an enzyme that confers resistance to a group of antibiotics including G4 1 8. The 
cells are seeded in alpha minus MEM supplemented with 1 mg/ml G41 8. After 2 
days, the cells are trypsinized and seeded in hybridoma cloning plates (Greiner, 
Germany) in alpha minus MEM supplemented with 10, 25, or 50 ng/ml of 
metothrexate plus 1 mg/ml G418. After about 10-14 days single clones are 
trypsinized and then seeded in 6-well petri dishes or 10 ml flasks using different 
concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). 
Clones growing at the highest concentrations of methotrexate are then transferred 
to new 6-well plates containing even higher concentrations of methotrexate ( 1 uM, 
2 uM, 5 uM, 10 mM, 20 mM). The same procedure is repeated until clones are 
obtained which grow at a concentration of 1 00 - 200 uM. Expression of METH1 
or METH2 is analyzed, for instance, by SDS-PAGE and Western blot or by 
reversed phase HPLC analysis. 

Example 11: Construction of N-Terminal and/or C-Terminal Deletion 
Mutants 

The following general approach may be used to clone a N-terminal or C- 
terminal deletion METH1 or METH2 deletion mutant. Generally, two 
oligonucleotide primers of about 1 5-25 nucleotides are derived from the desired 
5' and 3' positions of a polynucleotide of SEQ ID NO:l or SEQ ID NO:3. The 
5' and 3' positions of the primers are determined based on the desired METH1 or 
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METH2 polynucleotide fragment. An initiation and stop codon are added to the 
5' and 3' primers respectively, if necessary, to express the METH1 or METH2 
polypeptide fragment encoded by the polynucleotide fragment. Preferred METH 1 
or METH2 polynucleotide fragments are those encoding the N-terminal and C- 
terminal deletion mutants disclosed above in the "Polynucleotide and Polypeptide 
Fragments" section of the Specification. 

Additional nucleotides containing restriction sites to facilitate cloning of 
the METH1 or METH2 polynucleotide fragment in a desired vector may also be 
added to the 5* and 3' primer sequences. The METH1 or METH2 polynucleotide 
fragment is amplified from genomic DNA or from the deposited cDNA clone 
using the appropriate PCR oligonucleotide primers and conditions discussed 
herein or known in the art. The METH1 or METH2 polypeptide fragments 
encoded by the METH1 or METH2 polynucleotide fragments of the present 
invention may be expressed and purified in the same general manner as the full 
length polypeptides, although routine modifications may be necessary due to the 
differences in chemical and physical properties between a particular fragment and 
full length polypeptide. 

As a means of exemplifying but not limiting the present invention, the 
polynucleotide encoding the METH1 polypeptide fragment D-40 to S-950 or the 
METH2 polypeptide fragment L-20 to L-890 is amplified and cloned as follows: 
A 5* primer is generated comprising a restriction enzyme site followed by an 
initiation codon in frame with the polynucleotide sequence encoding the N- 
terminal portion of the polypeptide fragment beginning with D-40 or L-20, 
respectively. A complementary 3 1 primer is generated comprising a restriction 
enzyme site followed by a stop codon in frame with the polynucleotide sequence 
encoding C-terminal portion of the METH1 or METH2 polypeptide fragment 
ending with S-950 or L-890, respectively. , 

The amplified polynucleotide fragment and the expression vector are 
digested with restriction enzymes which recognize the sites in the primers. The 
digested polynucleotides are then ligated together. The METH1 or METH2 
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polynucleotide fragment is inserted into the restricted expression vector, 
preferably in a manner which places the METH 1 or METH2 polypeptide fragment 
coding region downstream from the promoter. The ligation mixture is 
transformed into competent E. coli cells using standard procedures and as 
described in the Examples herein. Plasmid DNA is isolated from resistant colonies 
and the identity of the cloned DNA confirmed by restriction analysis, PCR and 
DNA sequencing. 

Example 12: Protein Fusions ofMETHl or METH2 

METH1 or METH2 polypeptides are preferably fused to other proteins. 
These fusion proteins can be used for a variety of applications. For example, 
fusion of METH 1 or METH2 polypeptides to His-tag, HA-tag, protein A, IgG 
domains, and maltose binding protein facilitates purification. (See Example 7; see 
also EP A 394,827; Traunecker, et al. 9 Nature 357:84-86 (1988).) Similarly, 
fusion to IgG- 1, IgG-3, and albumin increases the halflife time in vivo. Nuclear 
localization signals fused to METH1 or METH2 polypeptides can target the 
protein to a specific subcellular localization, while covalent heterodimer or 
homodimers can increase or decrease the activity of a fusion protein. Fusion 
proteins can also create chimeric molecules having more than one function. 
Finally, fusion proteins can increase solubility and/or stability of the fused protein 
compared to the non-fused protein. All of the types of fusion proteins described 
above can be made by modifying the following protocol, which outlines the fusion 
of a polypeptide to an IgG molecule, or the protocol described in Example 7. 

Briefly, the human Fc portion of the IgG molecule can be PCR amplified, 
using primers that span the 5' and 3 1 ends of the sequence described below. These 
primers also should have convenient restriction enzyme sites that will facilitate 
cloning into an expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc 
portion can be ligated into the BamHI cloning site. Note that the 3' BamHI site 
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should be destroyed. Next, the vector containing the human Fc portion is re- 
restricted with BamHI, linearizing the vector, and METH1 or METH2 
polynucleotide, isolated by the PCR protocol described in Example 5, is ligated 
into this BamHI site. Note that the polynucleotide is cloned without a stop codon, 
otherwise a fusion protein will not be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

Human IgG Fc region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACC 
GTGCCCAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCC 
CCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACTCCTGAGGT 
CACATGCGTGGTGGTGGACGTAAGCCACGAAGACCCTGAGGTCAAG 
TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAA 
AGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGT 
CCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAG 
TGCAAGGTCTCCAACAAAGCCCTCCCAACCCCCATCGAGAAAACCA 
TCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCT 
GCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACC 
TGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGTGGAGTGGG 
AGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGT 
GCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGG 
ACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGAT 
GCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTG 
TCTCCGGGTAAATGAGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID 
NO:85) 
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Example 13: Production of an Antibody 

The antibodies of the present invention can be prepared by a variety of 
methods. (See, Current Protocols, Chapter 2.) For example, cells expressing 
METH1 or METH2 is administered to an animal to induce the production of sera 
containing polyclonal antibodies. In a preferred method, a preparation of METH 1 
or METH2 protein is prepared and purified to render it substantially free of natural 
contaminants. Such a preparation is then introduced into an animal in order to 
produce polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al y Nature 

256:495-( 1975); -Kohler et al , Eur. J. Immunol (J:-5-l 1 (1 976); Kohler et al , Eur.~~ - 

J. Immunol 5:292 (1 976); Hammerling et al , in: Monoclonal Antibodies and T- 
Cell Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such 
procedures involve immunizing an animal (preferably a mouse) with METH1 or 
METH2 polypeptide or, more preferably, with a secreted METH1 or METH2 
polypeptide-expressing cell. Such cells may be cultured in any suitable tissue 
culture medium; however, it is preferable to culture cells in Earle's modified 
Eagle's medium supplemented with 1 0% fetal bovine serum (inactivated at about 
56 degree C), and supplemented with about 10 g/1 of nonessential amino acids, 
about 1,000 U/ml of penicillin, and about 100 ug/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable 
myeloma cell line. Any suitable myeloma cell line may be employed in accordance 
with the present invention; however, it is preferable to employ the parent myeloma 
cell line (SP20), available from the ATCC. After fusion, the resulting hybridoma 
cells are selectively maintained in HAT medium, and then cloned by limiting 
dilution as described by Wands et al {Gastroenterology 80:225-232 (1981).) The 
hybridoma cells obtained through such a selection are then assayed to identify 
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clones which secrete antibodies capable of binding the METH1 or METH2 
polypeptide. 

Alternatively, additional antibodies capable of binding to METH1 or 
METH2 polypeptide can be produced in a two-step procedure using anti-idiotypic 
antibodies. Such a method makes use of the fact that antibodies are themselves 
antigens, and therefore, it is possible to obtain an antibody which binds to a second 
antibody. In accordance with this method, protein specific antibodies are used to 
immunize an animal, preferably a mouse. The splenocytes of such an animal are 
then used to produce hybridoma cells, and the hybridoma cells are screened to 
identify clones which produce an antibody whose ability to bind to theMETHl or 
METH2 protein-specific antibody can be blocked by METH1 or METH2. Such 
antibodies comprise anti-idiotypic antibodies to the METH1 or METH2 protein- 
specific antibody and can bcTused "to immunize an animal to inducefforriiation of 
further METH1 or METH2 protein- specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the 
antibodies of the present invention may be used according to the methods 
disclosed herein. Such fragments are typically produced by proteolytic cleavage, 
using enzymes such as papain (to produce Fab fragments) or pepsin (to produce 
F(ab')2 fragments). Alternatively, secreted METH1 or METH2 protein-binding 
fragments can be produced through the application of recombinant DNA 
technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced 
using genetic constructs derived from hybridoma cells producing the monoclonal 
antibodies described above. Methods for producing chimeric antibodies are 
known in the art. (See, for review, Morrison, Science 229: 1 202 ( 1 985); Oi el al , 
BioTechniques 4:2 1 4 (1 986); Cabilly etal, U.S. Patent No. 4,8 1 6,567; Taniguchi 
et al, EP 171496; Morrison et al, EP 173494; Neuberger et al, WO 8601533; 
Robinson et al, WO 8702671; Boulianne et al, Nature 312:643 (1984); 
Neuberger et al, Nature 314.26% (1985).) 
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Example 14: Production Of METH1 or METH2 Protein For High- 
Throughput Screening Assays 

The following protocol produces a supernatant containing METH1 or 
METH2 polypeptide to be tested. This supernatant can then be used in the 
Screening Assays described in Examples 16-23. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(lmg/ml in PBS) 1 :20 in PBS (w/o calcium or magnesium 1 7-5 1 6F Biowhittaker) 
for a working solution of 50ug/ml. Add 200 ul of this solution to each well (24 
well plates) and incubate at RT for 20 minutes. Be sure to distribute the solution 
over each well (note: a 12-channel pipetter may be used with tips on every other 
channel). Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS 
(Phosphate Buffered Saline). The PBS should remain in the well until just prior 
to pIating-.the.ceUs and plates may be poly-lysine coated -in advance for up to two 
weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 1 0 s cells/well in .5ml 
DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L- 
glutamine (12-604F Biowhittaker))/! 0% heat inactivated FBS(14-503F 
Biowhittaker)/! x Penstrep( 1 7-602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well 
plate. With a small volume multi-channel pipetter, aliquot approximately 2ug of 
an expression vector containing a polynucleotide insert, produced by the methods 
described in Examples 10-12, into an appropriately labeled 96- well round bottom 
plate. With a multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I 
mixture to each well. Pipette up and down gently to mix. Incubate at RT 1 5-45 
minutes. After about 20 minutes, use a multi-channel pipetter to add 150ul 
Optimem I to each well. As a control, one plate of vector DNA lacking an insert 
should be transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the 
following tasks. By tag-teaming, hands on time is cut in half, and the cells do not 
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spend too much time on PBS. First, person A aspirates off the media from four 
24-weIl plates of cells, and then person B rinses each well with .5- 1ml PBS. 
Person A then aspirates off PBS rinse, and person B, using al2-channel pipetter 
with tips on every other channel, adds the 200ul of DN A/Lipofectamine/Optimem 
I complex to the odd wells first, then to the even wells, to each row on the 24-well 
plates. Incubate at 37 degree C for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in 
DMEM with lx penstrep, or HGS CHO-5 media (1 16.6 mg/L of CaCl 2 (anhyd); 
0.00130 mg/L CuS0 4 -5H 2 0; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of 
FeS0 4 -7H 2 0; 3 1 1 .80 mg/L of Kcl; 28.64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 
6995.50 mg/L of NaCl; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 
71.02 mg/L of NaJHP0 4 ; .4320 mg/L of ZnS0 4 -7H 2 0; .002 mg/L of Arachidonic 
Acid; 1.022 mg/L of Cholesterol; .070 mg/L of DL-alpha-Tocopherol-Acetate; 
0.0520 mg/L of Linoleic Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of 
Myristic Acid; 0.010 mg/L of Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 
mg/L of Palmitic Acid; 100 mg/L of Pluronic F-68; 0.010 mg/L of Stearic Acid; 
2.20 mg/L of Tween 80; 4551 mg/L of D-Glucose; 130.85 mg/ml of L- Alanine; 
147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml of L-Asparagine-H 2 0; 6.65 mg/ml 
of L-Aspartic Acid; 29.56 mg/ml of L-Cystine-2HCL-H 2 0; 31.29 mg/ml of L- 
Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 mg/ml of L-Glutamine; 
18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL-H 2 0; 106.97 mg/ml of 
L-Isoleucine; 1 1 1 .45 mg/ml of L-Leucine; 163.75 mg/ml of L-Lysine HCL; 32.34 
mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 mg/ml of L- 
Proline; 26.25 mg/ml of L-Serine; 1 01 .05 mg/ml of L-Threonine; 19.22 mg/ml of 
L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; and 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L of 
Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Ipositol; 3.02 mg/L 
of Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 
0.319 mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of 
Thymidine; 0.680 mg/L of Vitamin B 12 ; 25 mM of HEPES Buffer; 2.39 mg/L of 
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Na Hypoxanthine; 0. 1 05 mg/L of Lipoic Acid ; 0.081 mg/L of Sodium Putrescine- 
2HCL; 55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM 
of Ethanolamine; 0.122 mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B- 
Cyclodextrin complexed with Linoleic Acid; 33.33 mg/L of Methyl-B- 
Cyclodextrin complexed with Oleic Acid; 1 0 mg/L of Methyl-B-Cyclodextrin 
complexed with Retinal Acetate. Adjust osmolarity to 327 mOsm) with 2mm 
glutamine and Ix penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L 
DMEM for a 10% BSA stock solution). Filter the media and collect 50 ul for 
endotoxin assay in 15ml polystyrene conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the 
end of the incubation period. Person A aspirates off the transfection media, while 
person B adds 1 .5ml appropriate media to each well. Incubate at 37 degree C for 
45 or 72 hours depending on the media used: 1 %BSA for 45 hours or CHO-5 
for 72 "hours."" 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1 ml 
deep well plate and the remaining supernatant into a 2ml deep well. The 
supernatants from each well can then be used in the assays described in Examples 
16-23. 

It is specifically understood that when activity is obtained in any of the 
assays described below using a supernatant, the activity originates from either the 
METH1 or METH2 polypeptide directly (e.g., as a secreted protein) or by 
METH1 or METH2 inducing expression of other proteins, which are then 
secreted into the supernatant. Thus, the invention further provides a method of 
identifying the protein in the supernatant characterized by an activity in a particular 
assay. 

Example IS: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and 
proliferation of cells is called the Jaks-STATs pathway. Activated proteins in the 
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Jaks-STATs pathway bind to gamma activation site "GAS" elements or interferon- 
sensitive- responsive element ("ISRE"), located in the promoter of many genes. 
The binding of a protein to these elements alter the expression of the associated 
gene. 

GAS and ISRE elements are recognized by a class of transcription factors 
called Signal Transducers and Activators of Transcription, or "STATs. " There are 
six members of the STATs family. Statl and Stat3 are present in many cell types, 
as is Stat2 (as response to EFN-alpha is widespread). Stat4 is more restricted and 
is not in many cell types though it has been found in T helper class I, cells after 
treatment with IL-1 2. StatS was originally called mammary growth factor, but has 
been found at higher concentrations in other cells including myeloid cells. It can 
be activated in tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus 

upon tyrosine phosphorylation by a set of kinases known as the Janus Kinase 
("Jaks") family. Jaks represent a distinct family of soluble tyrosine kinases and 
include Tyk2, Jakl, Jak2, and Jak3. These kinases display significant sequence 
similarity and are generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the 
Table below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 
64:621-51 (1995).) A cytokine receptor family, capable of activating Jaks, is 
divided into two groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, BL-6, 
EL-7, IL-9, IL-1 1, EL-12, IL-1 5, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, 
and thrombopoietin; and (b) Class 2 includes IFN-a, IFN-g, and IL-10. The Class 
1 receptors share a conserved cysteine motif (a set of four conserved cysteines and 
one tryptophan) and a WSXWS motif (a membrane proxial region encoding Trp- 
Ser-Xxx-Trp-Ser (SEQ ED NO:82)). 

Thus, on binding of a iigand to a receptor, Jaks are activated, which in turn 
activate STATs, which then translocate and bind to GAS elements. This entire 
process is encompassed in the Jaks-STATs signal transduction pathway. 
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Therefore, activation of the Jaks-STATs pathway, reflected by the binding 
of the GAS or the 1SRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and 
cytokines are known to activate the Jaks-STATs pathway. (See Table below.) 
Thus, by using GAS elements linked to reporter molecules, activators of the Jaks- 
STATs pathway can be identified. 
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To construct a synthetic GAS containing promoter element, which is used 
in the Biological Assays described in Examples 16-17, a PCR based strategy is 
employed to generate a GAS-SV40 promoter sequence. The 5* primer contains 
four tandem copies of the GAS binding site found in the IRP1 promoter and 
previously demonstrated to bind STATs upon induction with a range of cytokines 
(Rothman et al y Immunity 7:457-468 (1994).), although other GAS or ISRE 
elements can be used instead. The 5' primer also contains 1 8bp of sequence 
complementary to the S V40 early promoter sequence and is flanked with an Xhol 
site. The sequence of the 5' primer is: 

5':GCGCCTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTT 
CCCCGAAATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3 , (SEQID 
~ NOT86) 

The downstream primer is complementary to the SV40 promoter and is 
flanked with a Hind III site: 5 1 : GCGGC AAGCTTTTTGC AAAGCCT AGGC : 3 ' 
(SEQ ID NO:87) 

PCR amplification is performed using the S V40 promoter template present 
in the B-gal: promoter plasmid obtained from Clontech. The resulting PCR 
fragment is digested with Xhol/Hind III and subcloned into BLSK2- 
(Stratagene.) Sequencing with forward and reverse primers confirms that the 
insert contains the following sequence: 

5 ! : CTCGAG ATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCC 
GAAATGATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCA 
TAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGT 
TCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTATGCA 
GAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAG 
GAGGCTTTTTTGGAGGCCT AGGCTTTTGC AAA AAGCTT : 3 ' (SEQ ID 
NO:88) 

With this GAS promoter element linked to the S V40 promoter, a 
GAS:SEAP2 reporter construct is next engineered. Here, the reporter molecule 
is a secreted alkaline phosphatase, or "SEAP." Clearly, however, any reporter 
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molecule can be instead of SEAP, in this or in any of the other Examples. Well 
known reporter molecules that can be used instead of SEAP include 
chloramphenicol acetyltransferase (CAT), luciferase, alkaline phosphatase, B- 
galactosidase, green fluorescent protein (GFP), or any protein detectable by an 
antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll 
and Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 
promoter element, to create the GAS-SEAP vector. However, this vector does 
not contain a neomycin resistance gene, and therefore, is not preferred for 
mammalian expression systems. 

Thus; in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector 
using Sail and Notl, and inserted into a backbone vector containing the neomycin 
resistance gene, such as pGFP-1 (Clontech), using these restriction sites in the 
multiple cloning site, to create the GAS-SEAP/Neo vector. Once this vector is 
transfected into mammalian cells, this vector can then be used as a reporter 
molecule for GAS binding as described in Examples 16-17. 

Other constructs can be made using the above description and replacing 
GAS with a different promoter sequence. For example, construction of reporter 
molecules containing NFK-B and EGR promoter sequences are described in 
Examples 1 8 and 19. However, many other promoters can be substituted using 
the protocols described in these Examples. For instance, SRE, EL-2, NFAT, or 
Osteocalcin promoters can be substituted, alone or in combination (e.g., 
GAS/NF-KB/EGR, GAS/NF-KB, I1-2/NFAT, orNF-KB/GAS). Similarly, other 
cell lines can be used to test reporter construct activity, such as HELA (epithelial), 
HUVEC (endothelial), Reh (B-cell), Saos-2 (osteoblast), HUVAC (aortic), or 
Cardiomyocyte. 
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Example 16: High-Throughput Screening Assay for T-cell Activity 

The following protocol is used to assess T-cell activity of METH1 or 
METH2 by determining whether METH1 or METH2 supernatant proliferates 
and/or differentiates T-cells. T-cell activity is assessed using the GAS/SEAP/Neo 
construct produced in Example 15. Thus, factors that increase SEAP activity 
indicate the ability to activate the Jaks-STATS signal transduction pathway. The 
T-cell used in this assay is Jurkat T-cells (ATCC Accession No. TIB- 152), 
although Molt-3 cells (ATCC Accession No. CRL-1 552) and Molt-4 cells (ATCC 
Accession No. CRL-1 582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to 
generate stable cell lines, approximately 2 million Jurkat cells are transfected with 
the GAS-SEAP/neo vector using DMKIE-C" (Life Technologies)(trahsfection 
procedure described below). The transfected cells are seeded to a density of 
approximately 20,000 cells per well and transfectants resistant to 1 mg/ml genticin 
selected. Resistant colonies are expanded and then tested for their response to 
increasing concentrations of interferon gamma. The dose response of a selected 
clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in 
RPMI + 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life 
Technologies) with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI- 
MEM containing 50 ul of DMRIE-C and incubate at room temperature for 1 5-45 
mins. 

During the incubation period, count cell concentration, spin down the 
required number of cells (10 7 per transfection), and resuspend in OPTI-MEM to 
a final concentration of 1 0 7 cells/ml. Then add 1 ml of 1 x 1 0 7 cells in OPTI-MEM 
to T25 flask and incubate at 37 degree C for 6 hrs. After the incubation, add 10 
ml of RPMI + 1 5% serum. 
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The Jurkat:GAS-SEAP stable reporter lines are maintained in RPM1 + 
10% serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with 
. supernatants containing METH 1 or METH2 polypeptides or METH 1 or METH2 
induced polypeptides as produced by the protocol described in Example 14. 

On the day of treatment with the supernatant, the cells should be washed 
and resuspended in fresh RPM1 + 1 0% serum to a density of 500,000 cells per ml. 
The exact number of cells required will depend on the number of supernatants 
being screened. For one 96 well plate, approximately 10 million cells (for 10 
plates, 1 00 million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the 
cells into a 96 well dish, using a 1 2 channel pipette. Using a 1 2 channel pipette, 
transfer 200 ul of cells into each well (therefore adding 100,000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are 
transferred directly from the 96 well plate containing the supernatants into each 
well using a 12 channel pipette. In addition, a dose of exogenous interferon 
gamma (0. 1 , 1 .0, 1 0 ng) is added to wells H9, HI 0, and HI 1 to serve as additional 
positive controls for the assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are 
placed in an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 
35 ul samples from each well are then transferred to an opaque 96 well plate using 
a 12 channel pipette. The opaque plates should be covered (using sellophene 
covers) and stored at -20 degree C until SEAP assays are performed according to 
Example 20. The plates containing the remaining treated cells are placed at 4 
degree C and serve as a source of material for repeating the assay on a specific 
well if desired. 

As a positive control, 1 00 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in 
the positive control wells. 
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Example J 7: High-Throughput Screening Assay Identifying Myeloid Activity 

The following protocol is used to assess myeloid activity of METH1 or 
METH2 by determining whether METH1 or METH2 proliferates and/or 
differentiates myeloid cells. Myeloid cell activity is assessed using the 
5 GAS/SEAP/Neo construct produced in Example 15. Thus, factors that increase 

SEAP activity indicate the ability to activate the Jaks-STATS signal transduction 
pathway. The myeloid cell used in this assay is U937, a pre-monocyte cell line, 
although TF-1, HL60, or KG1 can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct 
1 0 produced in Example 1 5, a DEAE-Dextran method (Kharbanda eL al , 1 994, Cell 

Growth & Differentiation 5:259-265) is used. First, harvest 2x1 Oe 7 U937 cells 
and wash with PBS. The U937 cells are usually grown in RPMI 1640 medium 
containing 10% heat-inactivated fetal bovine serum (FBS) supplemented with 100 
units/ml penicillin and 100 mg/ml streptomycin. 
15 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer 

containing 0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 1 40 mM 
NaCl, 5 mM KCl, 375 uM ^HPO^HA 1 mM MgCl 2 , and 675 uM CaCl 2 . 
Incubate at 37 degree C for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
20 resuspend in 10 ml complete medium and incubate at 37 degree C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G41 8. The G41 8-free medium is used for routine growth but every one to 
two months, the cells should be re-grown in 400 ug/ml G418 for couple of 
passages. 

25 These cells are tested by harvesting lxl 0 8 cells (this is enough for ten 96- 

well plates assay) and wash with PBS. Suspend the cells in 200 ml above 
described growth medium, with a final density of 5x1 0 5 cells/ml. Plate 200 ul cells 
per well in the 96-well plate (or lxlO 5 cells/well), 
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Add 50 ill of the supernatant prepared by the protocol described in 
Example 1 4. Incubate at 37 degee C for 48 to 72 hr. As a positive control, 1 00 
Unit/ml interferon gamma can be used which is known to activate U937 cells. 
Over 30 fold induction is typically observed in the positive control wells. SEAP 
assay the supernatant according to the protocol described in Example 20. 

Example 1 8: High- Throughput Screening Assay Identifying Neuronal A ctivity 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these 
genes, EGR1 (early growth response gene 1 ), is induced in various tissues and cell 
types upon activation. The promoter of EGR1 is responsible for such induction. — 
Using the EGR1 promoter linked to reporter molecules, activation of cells can be 
assessed by METH1 or METH2. 

Particularly, the following protocol is used to assess neuronal activity in 
PC12 cell lines. PCI 2 cells (rat phenochromocytoma cells) are known to 
proliferate and/or differentiate by activation with a number of mitogens, such as 
TPA (tetradecanoyl phorbol acetate), NGF (nerve growth factor), and EGF 
(epidermal growth factor). The EGR1 gene expression is activated during this 
treatment. Thus, by stably transfecting PC 12 cells with a construct containing an 
EGR promoter linked to SEAP reporter, activation of PC 12 cells by METH1 or 
METH2 can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following 
protocol. The EGR-1 promoter sequence (-633 to + l)(Sakamoto K et al y 
Oncogene 6:867-873 (1991)) can be PCR amplified from human genomic DNA 
using the following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID 

NO:89) 

5' GCGAAGCTTCGCGACTCCCCGG ATCCGCCTC-3 ' (SEQ ID 

NO:90) 
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Using the GAS:SEAP/Neo vector produced in Example 15, EGR1 
amplified product can then be inserted into this vector. Linearize the 
GAS:SEAP/Neo vector using restriction enzymes Xhol/Hindlll, removing the 
GAS/SV40 stuffer. Restrict the EGRI amplified product with these same 
enzymes. Ligate the vector and the EGRI promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution 
(1:30 dilution of collagen type I (Upstate Biotech Inc. Cat#08-115) in 30% 
ethanol (filter sterilized)) is added per one 1 0 cm plate or 50 ml per well of the 96- 
well plate, and allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPM1-1640 medium (Bio Whittaker) 
containing 1 0% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat-_ 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin 
and 100 ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four 
split is done every three to four days. Cells are removed from the plates by 
scraping and resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PCI 2 using the 
Lipofectamine protocol described in Example 14. EGR-SEAP/PC12 stable cells 
are obtained by growing the cells in 300 ug/ml G418. The G418-free medium is 
used for routine growth but every one to two months, the cells should be re- 
grown in 300 ug/ml G418 for couple of passages. 

To assay for neuronal activity, a 1 0 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
(Phosphate buffered saline). Then starve the cells in low serum medium (RPM3- 
1640 containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. 
Scrape off the cells from the plate, suspend the cells well in 2 ml low serum 
medium. Count the cell number and add more low serum medium to reach final 
cell density as 5x1 0 s cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent 
to lxl 0 s cells/well). Add 50 ul supernatant produced by Example 14, 37 degree 
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C for 48 to 72 hr. As a positive control, a growth factor known to activate PC 12 
cells through EGR can be used, such as 50 ng/ul of Neuronal Growth Factor 
(NGF). Over fifty-fold induction of SEAP is typically seen in the positive control 
wells. SEAP assay the supernatant according to Example 20. 

Example 19: High-Throughput Screening Assay for T-cell Activity 

NF-KB (Nuclear Factor KB) is a transcription factor activated by a wide 
variety of agents including the inflammatory cytokines IL-1 and TNF, CD30 and 
CD40, lympho toxin-alpha and lymphotoxin-beta, by exposure to LPS orthrombin, 
and by expression of certain viral gene products. As a transcription factor, NF-KB 
regulates the expression of genes-involved in immune cell activation, control of 
apoptosis (NF- KB appears to shield cells from apoptosis), B and T-cell 
development, anti-viral and antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- KB is retained in the cytoplasm with I- 
KB (Inhibitor KB). However, upon stimulation, I- KB is phosphorylated and 
degraded, causing NF- KB to shuttle to the nucleus, thereby activating 
transcription of target genes. Target genes activated by NF- KB include IL-2, 
IL-6, GM-CSF, ICAM-1 and class 1 MHC. 

Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-KB promoter element are used to screen the 
supernatants produced in Example 14. Activators or inhibitors of NF-KB would 
be useful in treating diseases. For example, inhibitors of NF-KB could be used to 
treat those diseases related to the acute or chronic activation of NF-KB, such as 
rheumatoid arthritis. 

To construct a vector containing the NF-KB promoter element, a PCR 
based strategy is employed. The upstream primer contains four tandem copies of 
the NF-KB binding site (GGGGACTTTCCC) (SEQ ID NO:91), 18 bp of 
sequence complementary to the 5' end of the SV40 early promoter sequence, and 
is flanked with an Xhol site: 
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S^GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCG 
GG ACTTTCC ATCCTGCC ATCTC AATTAG : 3 ' (SEQ ID NO:92) 

The downstream primer is complementary to the 3' end of the SV40 
promoter and is flanked with a Hind III site: 

5 ' : GCGGC AAGCTTTTTGC AAAGCCT AGGC : 3 1 (SEQ ED N0.93) 

PCR amplification is performed using the S V40 promoter template present 
in the pB -gal: promoter plasmid obtained from Clontech. The resulting PCR 
fragment is digested with Xhol and Hind III and subcloned into BLSK2-. 
(Stratagene) Sequencing with the T7 and T3 primers confirms the insert contains 
the following sequence: 

S^CTCGAGGGGACT 

TTCCATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAA 
CTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCG 
CCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGC 
CTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGA 
GGCCTAGGCTTTTGCAAAAAGCTT: 3* (SEQ ID NO:88) 

Next, replace the S V40 minimal promoter element present in the pSE AP2- 
promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and 
Hindlll. However, this vector does not contain a neomycin resistance gene, and 
therefore, is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, theNF-KB/SV40/SEAP 
cassette is removed from the above NF-KB/SEAP vector using restriction 
enzymes Sail and NotI, and inserted into a vector containing neomycin resistance. 
Particularly, the NF-KB/S V40/SEAP cassette was inserted into pGFP-1 
(Clontech), replacing the GFP gene, after restricting pGFP-1 with Sail and NotI. 

Once NF-KB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 16. 
Similarly, the method for assaying supernatants with these stable Jurkat T-cells is 
also described in Example 16. As a positive control, exogenous TNF alpha (0.1,1, 
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10 ng) is added to wells H9, H10, and HI 1, with a 5-10 fold activation typically 
observed. 

Example 20: Assay for SEAP Activity 

As a reporter molecule for the assays described in Examples 16-19, SEAP 
activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to 
the following general procedure. The Tropix Phospho-light Kit supplies the 
Dilution, Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 1 5 ul of 2.5x 
dilution buffer into Optiplates containing 35 ul of a supernatant. Seal the plates 
with a plastic sealer and incubate at 65 degree C for 30 min. - Separate the 
Optiplates to avoid uneven heating. 

Cool the samples to room temperature for 15 minutes. Empty the 
dispenser and prime with the Assay Buffer. Add 50 ml Assay Buffer and incubate 
at room temperature 5 min. Empty the dispenser and prime with the Reaction 
Buffer (see the table below). Add 50 ul Reaction Buffer and incubate at room 
temperature for 20 minutes. Since the intensity of the chemiluminescent signal is 
time dependent, and it takes about 10 minutes to read 5 plates on luminometer, 
one should treat 5 plates at each time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set H 1 2 as blank, and print 
the results. An increase in chemiluminescence indicates reporter activity. 



Reaction Buffer Formulation: 

#of plates Rxn buffer diluent (ml) CSPD (ml) 

10 60 3 

11 65 3.25 

12 70 3.5 

13 75 3.75 

14 80 4 

15 85 4.25 

16 90 4.5 

17 95 4.75 

18 100 5 
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Example 21: High-Throughput Screening Assay Identifying Changes in Small 
Molecule Concentration and Membrane Permeability * 

35 Binding of a ligand to a receptor is known to alter intracellular levels of 

small molecules, such as calcium, potassium, sodium, and pH, as well as alter 
membrane potential. These alterations can be measured in an assay to identify 
supernatants which bind to receptors of a particular cell. Although the following 
protocol describes an assay for calcium, this protocol can easily be modified to 
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detect changes in potassium, sodium, pH, membrane potential, or any other small 
molecule which is detectable by a fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLEPR") 
to measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be 
used instead of the calcium fluorescent molecule, fluo-3, used here. 

For adherent cells, seed the cells at 1 0,000 -20,000 cells/well in a Co-star 
black 96-well plate with clear bottom. The plate is incubated in a C0 2 incubator 
for 20 hours. The adherent cells are washed two times in Biotek washer with 200 
ul of HBSS (Hank's Balanced Salt Solution) leaving 1 00 ul of buffer after the final 
wash. 

— A stock solution of 1 mg/ml -fluo-3-is made in 1 0% pluronie-acid DMSOr - 
To load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The 
plate is incubated at 37 degree C in a C0 2 incubator for 60 min. The plate is 
washed four times in the Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells 
are re-suspended to 2-5x1 0 6 cells/ml with HBSS in a 50-ml conical tube. 4 ul of 
1 mg/ml fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell 
suspension. The tube is then placed in a 37 degree C water bath for 30-60 min. 
The cells are washed twice with HBSS, resuspended to lxl 0 6 cells/ml, and 
dispensed into a microplate, 1 00 ul/well. The plate is centrifiiged at 1000 rpm for 
5 min. The plate is then washed once in Denley CellWash with 200 ul, followed 
by an aspiration step to 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such 
as fluo-3 . The supernatant is added to the well, and a change in fluorescence is 
detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for 
the following parameters: (1) System gain is 300-800 mW; (2) Exposure time 
is 0.4 second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 
530 nm; and (6) Sample addition is 50 ul. Increased emission at 530 nm indicates 
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an extracellular signaling event caused by the a molecule, either METH1 or 
METH2 or a molecule induced by METH1 or METH2, which has resulted in an 
increase in the intracellular Ca^ concentration. 

Example 22: High-Throughput Screening Assay Identifying Tyrosine Kinase 
Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine 
Kinase RPTK) group are receptors for a range of mitogenic and metabolic growth 
factors including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor 
subfamilies. In addition there are a large family _ of RPTKs for which the 
corresponding ligand is unknown. Ligands for RPTKs include mainly secreted 
small proteins, but also membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor 
dimerization, resulting in transphosphorylation of the receptor subunits and 
activation of the cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases 
include receptor associated tyrosine kinases of the src-family (e.g., src, yes, Ick, 
lyn, fyn) and non-receptor linked and cytosolic protein tyrosine kinases, such as 
the Jak family, members of which mediate signal transduction triggered by the 
cytokine superfamily of receptors (e.g., the Interleukins, Interferons, GM-CSF, 
and Leptin). 

Because of the wide range ofknown factors capable of stimulating tyrosine 
kinase activity, identifying whether METH1 or METH2 or a molecule induced by 
METH1 or METH2 is capable of activating tyrosine kinase signal transduction 
pathways is of interest. Therefore, the following protocol is designed to identify 
such molecules capable of activating the tyrosine kinase signal transduction 
pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses 
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with 100% ethanol, rinsed with water and dried overnight. Some plates are 
coated for 2 hr with 100 ml of cell culture grade type 1 collagen (50 mg/ml), 
gelatin (2%) or polylysine (50 mg/ml), all of which can be purchased from Sigma 
Chemicals (St. Louis, MO) or 10% Matrigel purchased from Becton Dickinson 
(Bedford,MA), or calf serum, rinsed with PBS and stored at 4 degree C Cell 
growth on these plates is assayed by seeding 5,000 cells/well in growth medium 
and indirect quantitation of cell number through use of alamarBlue as described 
by the manufacturer Alamar Biosciences, Inc. (Sacramento, CA) after 48 hr. 
Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are used to 
cover the Loprodyne Silent Screen Plates. Falcon Microtest III cell culture plates 
can also be used in some proliferation experiments. 

~ To prepare extracts, A43 T cells are seeded onto the nylon membranes "of 
Loprodyne plates (20,000/200ml/well) and cultured overnight in complete 
medium. Cells are quiesced by incubation in serum-free basal medium for 24 hr. 
After 5-20 minutes treatment with EOF (60ng/ml) or 50 ul of the supernatant 
produced in Example 14, the medium was removed and 100 ml of extraction 
buffer ((20 mM HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 
mM Na3 V04, 2 mM Na4P207 and a cocktail of protease inhibitors (# 1 83 6 1 70) 
obtained from Boehringer Mannheim (Indianapolis, IN) is added to each well and 
the plate is shaken on a rotating shaker for 5 minutes at 4°C. The plate is then 
placed in a vacuum transfer manifold and the extract filtered through the 0.45 mm 
membrane bottoms of each well using house vacuum. Extracts are collected in 
a 96-well catch/assay plate in the bottom of the vacuum manifold and immediately 
placed on ice. To obtain extracts clarified by centrifiigation, the content of each 
well, after detergent solubilization for 5 minutes, is removed and centrifuged for 
1 5 minutes at 4 degree C at 1 6,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although 
many methods of detecting tyrosine kinase activity are known, one method is 
described here. 
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Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate 
(a biotinylated peptide). Biotinylated peptides that can be used for this purpose 
include PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2- 
p34) and PSK2 (corresponding to amino acids 1 - 1 7 of gastrin). Both peptides are 
substrates for a range of tyrosine kinases and are available from Boehringer 
Mannheim. 

The tyrosine kinase reaction is set up by adding the following components 
in order, First, add lOul of 5uM Biotinylated Peptide, then lOut ATP/Mg 2 * (5mM 
ATP/50mM MgC12), then lOul of 5x Assay Buffer (40mM imidazole 
hydrochloride^ pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM 
MgCl 2 , 5 mM MnCl 2 , 0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and 
then 5ul of water. Mix the components gently and preincubate the reaction mix 
at 30 degree C for 2 min. Initial the reaction by adding 1 Oul of the control enzyme 
or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 
120mm EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of 
reaction mixture to a microtiter plate (MTP) module and incubating at 37 degree 
C for 20 min. This allows the streptavadin coated 96 well plate to associate with 
the biotinylated peptide. Wash the MTP module with 300ul/well of PBS four 
times. Next add 75 ul of anti-phospotyrosine antibody conjugated to horse radish 
peroxidase(anti-P-Tyr-POD(0.5u/ml)) to each well and incubate at 37 degree C 
for one hour. Wash the well as above. 

Next add 1 OOul of peroxidase substrate solution (Boehringer Mannheim) 
and incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absorbance of the sample at 405 nm by using ELISA reader. The level of bound 
peroxidase activity is quantitated using an ELISA reader and reflects the level of 
tyrosine kinase activity. 
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Example 23: High-Throughput Screening Assay Identifying Phosphorylation 
Activity 

As a potential alternative and/or compliment to the assay of protein 
tyrosine kinase activity described in Example 22, an assay which detects activation 
(phosphorylation) of major intracellular signal transduction intermediates can also 
be used. For example, as described below one particular assay can detect tyrosine 
phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of 
other molecules, such as Raf, INK, p3 8 MAP, Map kinase kinase (MEK), MEK 
kinase, Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any. 
other phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be 
detected by substituting these molecules for Erk- 1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96- well ELIS A 
plate with 0. 1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates 
are then rinsed with PBS and blocked with 3% BS A/PBS for 1 hr at RT. The 
protein G plates are then treated with 2 commercial monoclonal antibodies 
( lOOng/well) against Erk-1 and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). 
(To detect other molecules, this step can easily be modified by substituting a 
monoclonal antibody detecting any of the above described molecules.) After 3-5 
rinses with PBS, the plates are stored at 4 degree C until use. 

A43 ] cells are seeded at 20,000/weIl in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the 
supernatants obtained in Example 14 for 5-20 minutes. The cells are then 
solubilized and extracts filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. 
As a positive control, a commercial preparation of MAP kinase (1 Ong/well) is used 
in place 

of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) 
antibody (lug/ml) which specifically recognizes the phosphorylated epitope of the 
Erk-1 and Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard 
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procedures. The bound polyclonal antibody is then quantitated by successive 
incubations with Europium-streptavidin and Europium fluorescence enhancing 
reagent in the Wallac DELFIA instrument (time-resolved fluorescence). An 
increased fluorescent signal over background indicates a phosphorylation by 
METH1 or METH2 or a molecule induced by METH1 or METH2. 

Example 24: Method of Determining Alterations in the METH1 or METH2 
Gene 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated 
from these RNA samples using protocols known in the art. (See,- Sambrook.) 
The cDNA is then used as a template for PCR, employing primers surrounding 
regions of interest in SEQ ID NO: 1 . Suggested PCR conditions consist of 35 
cycles at 95 degree C for 30 seconds; 60-120 seconds at 52-58 degree C; and 60- 
120 seconds at 70 degree C, using buffer solutions described in Sidransky, D. et 
al., Science 252:106 (1991). 

PCR products are then sequenced using primers labeled at their 5' end with 
T4 polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre 
Technologies). The intron-exon borders of selected exons of METH 1 or METH2 
is also determined and genomic PCR products analyzed to confirm the results. 
PCR products harboring suspected mutations in METH1 or METH2 is then 
cloned and sequenced to validate the results of the direct sequencing. 

PCR products of METH 1 or METH2 are cloned into T-tailed vectors as 
described in Holton, T.A. and Graham, M.W., Nucleic Acids Research 19\\ 156 
(1 991 ) and sequenced with T7 polymerase (United States Biochemical). Affected 
individuals are identified by mutations in METH1 or METH2 not present in 
unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 
alterations in the METH1 or METH2 gene. Isolated genomic clones are nick- 
translated with digoxigenindeoxy-uridine 5-triphosphate (Boehringer Manheim), 
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and FISH performed as described in Johnson, Cg. et al., Methods Cell Biol. 35:73- 
99 ( 1 99 1 ). Hybridization with the labeled probe is carried out using a vast excess 
of human cot-1 DNA for specific hybridization to the METH1 or METH2 
genomic locus. 

5 Chromosomes are counterstained with 4,6-diamino-2-phenyIidole and 

propidium iodide, producing a combination of C- and R-bands. Aligned images 
for precise mapping are obtained using a triple-band filter set (Chroma 
Technology, Brattleboro, VT) in combination with a cooled charge-coupled 
device camera (Photometries, Tucson, AZ) and variable excitation wavelength 

10 filters. (Johnson, Cv. et aL y Genet. Anal Tech. Appl 8:15 (1991).) Image 

collection, analysis and chromosomal fractional length measurements are 
performed~using the TSee GraphicaTProgram System. (Inovision Corporation, 
Durham, NC.) Chromosome alterations of the genomic region of METH1 or 
METH2 (hybridized by the probe) are identified as insertions, deletions, and 

15 translocations. These METH1 or METH2 alterations are used as a diagnostic 

marker for an associated disease. 

Example 25: Method of Detecting Abnormal Levels ofMETHl orMETHl in 
a Biological Sample 

METH1 or METH2 polypeptides can be detected in a biological sample, 
20 and if an increased or decreased level ofMETHl or METH2 is detected, this 

polypeptide is a marker for a particular phenotype. Methods of detection are 
numerous, and thus, it is understood that one skilled in the art can modify the 
following assay to fit their particular needs. 

For example, antibody-sandwich ELISAs are used to detect METH1 or 
25 METH2 in a sample, preferably a biological sample. Wells of a microliter plate 

are coated with specific antibodies to METH1 or METH2, at a final concentration 
of 0.2 to 10 ug/ml. The antibodies are either monoclonal or polyclonal and are 
produced by the method described in Example 13. The wells are blocked so that 
non-specific binding ofMETHl or METH2 to the well is reduced. 
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The coated wells are then incubated for > 2 hours at RT with a sample 
containing METH1 or METH2. Preferably, serial dilutions of the sample should 
be used to validate results. The plates are then washed three times with deionized 
or distilled water to remove unbounded METH1 or METH2. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
concentration of 25-400 ng, is added and incubated for 2 hours at room 
temperature. The plates are again washed three times with deionized or distilled 
water to remove unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. Measure the .reaction by a microliter plate reader. Prepare a 
standard curve, using serial dilutions of a control sample, and plot METH1 or 
METH2 polypeptide concentration on the X-axis (log scale) and fluorescence or 
absorbance of the Y-axis (linear scale). Interpolate the concentration of the 
METH1 or METH2 in the sample using the standard curve. 

Example 26: Formulating a Polypeptide 

The METH1 or METH2 composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
condition of the individual patient (especially the side effects of treatment with the 
METH1 or METH2 polypeptide alone), the site of delivery, the method of 
administration, the scheduling of administration, and other factors known to 
practitioners. The "effective amount" for purposes herein is thus determined by 

such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of 
METHl or METH2 administered parenteral^ per dose will be in the range of 
about lug/kg/day to 10 mg/kg/day of patient body weight, although, as noted 
above, this will be subject to therapeutic discretion. More preferably, this dose is 
at least 0.01 mg/kg/day, and most preferably for humans between about 0.01 and 
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1 mg/kg/day for the hormone. If given continuously, METH1 or METH2 is 
typically administered at a dose rate of about 1 ug/kg/hour to about 50 
ug/kg/hour, either by 1-4 injections per day or by continuous subcutaneous 
infusions, for example, using a mini-pump. An intravenous bag solution may also 
be employed. The length of treatment needed to observe changes and the interval 
following treatment for responses to occur appears to vary depending on the 
desired effect. 

Pharmaceutical compositions containing METH1 or METH2 are 
administered orally, rectally, parenterally, intracistemally, intravaginally, 
intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
patch), bucally, or as an oral or nasal spray, "Pharmaceutically acceptable carrier" 
refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material 
or formulation auxiliary of any type. The term "parenteral" as used herein refers 
to modes of administration which include intravenous, intramuscular, 
intraperitoneal, intrasternal, subcutaneous and intraarticular injection and infusion. 

METH1 or METH2 is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L- 
glutamate (Sidman, U. et al., Biopolymers 22:541-556 (1983)), poly (2- 
hydroxyethyl methacrylate) (R. Langer et al. , J. Biomed Mater. Res. 15 A 67-277 
(1 981), and R. Langer, Chem. Tech. 72:98-105 (1982)), ethylene vinyl acetate (R. 
Langer et al.) or poIy-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained- 
release compositions also include liposomally entrapped METH1 or METH2 
polypeptides. Liposomes containing the METH1 or METH2 are prepared by 
methods known per se: DE 3 ,2 1 8, 1 2 1 ; Epstein et al. , Proc. Natl. A cad. Sci. USA 
52:3688-3692 (1985); Hwang etal, Proc. Natl. Acad. Sci. USA 77:4030-4034 
(1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese 
Pat. Appl. 83-1 1 8008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 1 02,324. 
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Ordinarily, the liposomes are of the small (about 200-800 Angstroms) unilamellar 
type in which the lipid content is greater than about 30 mol. percent cholesterol, 
the selected proportion being adjusted for the optimal secreted polypeptide 

therapy. 

For parenteral administration, in one embodiment, METH1 or METH2 is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutical* 
acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the 
formulation. For example, the formulation preferably does not include oxidizing 
agents and other compounds that are known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting METH1 or 
METH2 uniformly and intimately with liquid carriers or finely divided solid 
carriers or both. Then, if necessary, the product is shaped into the desired 
formulation. Preferably the carrier is a parenteral carrier, more preferably a 
solution that is isotonic with the blood of the recipient. Examples of such carrier 
vehicles include water, saline, Ringer's solution, and dextrose solution. Non- 
aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as well 
as liposomes. 

The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials are non- 
toxic to recipients at the dosages and concentrations employed, and include 
buffers such as phosphate, citrate, succinate, acetic acid, and other organic acids 
or their salts; antioxidants such as ascorbic acid; low molecular weight (less than 
about ten residues) polypeptides, e.g., polyargmine or tripeptides; proteins, such 
as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, 
or arginine; monosaccharides, disaccharides, and other carbohydrates including 
cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as 
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EDTA; sugar alcohols such as mannitol or sorbitol; counterfoils such as sodium; 

and/or nonionic surfactants such as polysorbates, poloxamers, or PEG. 

METH1 or METH2 is typically formulated in such vehicles at a 

concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at apH 
5 of about 3 to 8. It will be understood that the use of certain of the foregoing 

excipients, carriers, or stabilizers will result in the formation of polypeptide salts. 
METH1 or METH2 used for therapeutic administration can be sterile. 

Sterility is readily accomplished by filtration through sterile filtration membranes 

(e.g., 0.2 micron membranes). Therapeutic polypeptide compositions generally 
]0 are placed into a container having a sterile access port, for example, an 

intravenous solution bag or vial having a stopper pierceable by a hypodermic 

injection needle. 

METH1 or METH2 polypeptides ordinarily will be stored in unit or multi- 
dose containers, for example, sealed ampoules or vials, as an aqueous solution or 

15 as a lyophiiized formulation for reconstitution. As an example of a lyophilized 

formulation, 10-ml vials are filled with 5 ml of sterile-filtered 1% (w/v) aqueous 
METH1 or METH2 polypeptide solution, and the resulting mixture is lyophilized. 
The infusion solution is prepared by reconstituting the lyophilized METH1 or 
METH2 polypeptide using bacteriostatic Water-for-Injection. 

20 The invention also provides a pharmaceutical pack or kit comprising one 

or more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such container(s) can be a notice 
in the form prescribed by a governmental agency regulating the manufacture, use 
or sale of pharmaceuticals or biological products, which notice reflects approval 

25 by the agency of manufacture, use or sale for human administration. In addition, 

METH1 or METH2 may be employed in conjunction with other therapeutic 
compounds. 
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Example 2 7: Method of Treating Decreased Levels ofMETHJ or METH2 
The present invention relates to a method for treating an individual in need 

of a decreased level of METH1 or METH2 activity in the body comprising, 

administering to such an individual a composition comprising a therapeutically 
5 effective amount of METH1 orMETH2 antagonist. Preferred antagonists for use 

in the present invention are METH1 or METH2-specific antibodies. 

Moreover, it will be appreciated that conditions caused by a decrease in 

the standard or normal expression level of METH1 or METH2 in an individual can 

be treated by administering METH1 or METH2, preferably in the secreted form. 
1 o Thus, the invention also provides a method of treatment of an individual in need 

of an increased level of METH lor METH2 polypeptide comprising administering 

to such an individual a pharmaceutical composition comprising an amount of 
METH1 or METH2 to increase the activity level of METH 1 or METH2 in such 
an individual . 

15 For example, a patient with decreased levels of METH1 or METH2 

polypeptide receives a daily dose 0.1-100 ug/kg of the polypeptide for six 
consecutive days. Preferably, the polypeptide is in the secreted form. The exact 
details of the dosing scheme, based on administration and formulation, are 
provided in Example 26. 

20 Example 28: Method of Treating Increased Levels ofMETHl or METH2 

The present invention also relates to a method for treating an individual in 
need of an increased level ofMETHl or METH2 activity in the body comprising 
administering to such an individual a composition comprising a therapeutically 
effective amount ofMETHl or METH2 or an agonist thereof. 
25 Antisense technology is used to inhibit production of METH 1 or METH2. 

This technology is one example of a method of decreasing levels ofMETHl or 
METH2 polypeptide, preferably a secreted form, due to a variety of etiologies, 
such as cancer. 
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For example, a patient diagnosed with abnormally increased levels of 
METH1 or METH2 is administered intravenously antisense polynucleotides at 
05,0,1.5,2.0 and 3 .0 mg/kg day for 2 1 days. This treatment is repeated after 
a 7-day rest period if the treatment was well tolerated. The formulation of the 
antisense polynucleotide is provided in Example 26. 

Example 29: Method of Treatment Using Gene Therapy - Ex Vivo 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing METH1 or METH2 polypeptides, onto a patient. Generally, 
fibroblasts are obtained from a subject by skin biopsy. The resulting tissue is 
placed in tissue-culture medium and separated into small pieces. Small chunks of 
the tissue are placed on a wet surface of a tissue culture flask, approximately ten 
pieces are placed in each flask. The flask is turned upside down, closed tight and 
left at room temperature over night. After 24 hours at room temperature, the 
flask is inverted and the chunks of tissue remain fixed to the bottom of the flask 
and fresh media (e.g., Ham's F12 media, with 10% FBS, penicillin and 
streptomycin) is added. The flasks are then incubated at 37 degree C for 

approximately one week. 

At this time, fresh media is added and subsequently changed every several 
days. After an additional two weeks in culture, a monolayer of fibroblasts emerge. 
The monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et aL, DM 7:219-25 (1988)), flanked by the 
long terminal repeats of the Moloney murine sarcoma virus, is digested with 
EcoRI and HindlH and subsequently treated with calf intestinal phosphatase. The 
linear vector is fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding METH1 or METH2 can be amplified using PCR 
primers which correspond to the 5' and 3' end sequences respectively as set forth 
in Example 5. Preferably, the 5' primer contains an EcoRI site and the 3' primer 
includes a HindHI site. Equal quantities of the Moloney murine sarcoma virus 
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linear backbone and the amplified EcoRI and Hindlll fragment are added together, 
in the presence of T4 DNA ligase. The resulting mixture is maintained under 
conditions appropriate for ligation of the two fragments. The ligation mixture is 
then used to transform bacteria HB101, which are then plated onto agar 
containing kanamycin for the purpose of confirming that the vector contains 
properly inserted METH1 or METH2. 

The amphotropic pA3 1 7 or GP+aml 2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 
10% calf serum (CS), penicillin and streptomycin. The MSV vector containing 
the METH1 or METH2 gene is then added to the media and the packaging cells 
transduced with the vector. _ The packaging cells now produce infectious viral 
particles containing the METH1 or METH2 gene(the packaging cells are now 
referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, 
the media is harvested from a 1 0 cm plate of confluent producer cells. The spent 
media, containing the infectious viral particles, is filtered through a millipore filter 
to remove detached producer cells and this media is then used to infect fibroblast 
cells. Media is removed from a sub-confluent plate of fibroblasts and quickly 
replaced with the media from the producer cells. This media is removed and 
replaced with fresh media. If the titer of virus is high, then virtually all fibroblasts 
will be infected and no selection is required. If the titer is very low, then it is 
necessary to use a retroviral vector that has a selectable marker, such as neo or 
his. Once the fibroblasts have been efficiently infected, the fibroblasts are analyzed 
to determine whether METH1 or METH2 protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone 
or after having been grown to confluence on cytodex 3 microcarrier beads. 
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Example 30: Method of Treatment Using Gene Therapy - In Vivo 

Another aspect of the present invention is using in vivo gene therapy 
methods to treat disorders, diseases and conditions. The gene therapy method 
relates to the introduction of naked nucleic acid (DN A, RN A, and antisense DNA 
5 or RN A) METH 1 or METH2 sequences into an animal to increase or decrease the 

expression of the METH1 or METH2 polypeptide. The METH1 or METH2 
polynucleotide may be operatively linked to a promoter or any other genetic 
elements necessary for the expression of the METH1 or METH2 polypeptide by 
the target tissue. Such gene therapy and delivery techniques and methods are 
-,0 k „ovvnintheart r se e ,--forexample,WO90/11092,Wp98/11779; U.S. Patent "NO. 

5693622, 57051 51, 5580859; TabataRe/a/. (1997) Cardiovasc. Res. 35(3)A10- 
479, Chao, J et al. (1997) Pharmacol. Res. 55^:517-522, Wolff J.A. (1997) 
Nwromuscul. Disord 7(5)314-31*, Schwartz, B. et al. (1996) Gene Ther. 
3(5)A05-4U, Tsurumi Y. et al. (1996) Circulation 94(12)32*1-3290 

1 5 (incorporated herein by reference) . 

The METH1 or METH2 polynucleotide constructs may be delivered by 
any method that delivers injectable materials to the cells of an animal, such as, 
injection into the interstitial space of tissues (heart, muscle, skin, lung, liver, 
intestine and the like). The METH1 or METH2 polynucleotide constructs can be 
20 delivered in a pharmaceutical^ acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that 
are free from any delivery vehicle that acts to assist, promote, or facilitate entry 
into the cell, including viral sequences, viral particles, liposome formulations, 
lipofectin or precipitating agents and the like. However, the METH 1 or METH2 
25 polynucleotides may also be delivered in liposome formulations (such as those 

taught inFelgnerP.L. elal. (1995) Ann. NY Acad. Sci. 772:126-139 and Abdallah 
B. et al. (1995) Biol. Cell 85(1):\-1) which can be prepared by methods well 
known to those skilled in the art. 
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The METH1 or METH2 polynucleotide vector constructs used in the 
gene therapy method are preferably constructs that will not integrate into the host 
genome nor will they contain sequences that allow for replication. Any strong 
promoter known to those skilled in the art can be used for driving the expression 
of DMA. Unlike other gene therapies techniques, one major advantage of 
introducing naked nucleic acid sequences into target cells is the transitory nature 
of the polynucleotide synthesis in the cells. Studies have shown that non- 
replicating DNA sequences can be introduced into cells to provide production of 
the desired polypeptide for periods of up to six months. 

The METH1 or METH2 polynucleotide construct can be delivered to the 
Tnterstitiai space ; Q f tissues within the an animal, including of muscle, skin, brain, 
lung, liver, spleen, bone marrow, thymus, heart, lymph, blood, bone, cartilage, 
pancreas, kidney, gall bladder, stomach, intestine, testis, ovary, uterus, rectum, 
nervous system, eye, gland, and connective tissue. Interstitial space of the tissues 
comprises the intercellular fluid, mucopolysaccharide matrix among the reticular 
fibers of organ tissues, elastic fibers in the walls of vessels or chambers, collagen 
fibers of fibrous tissues, or that same matrix within connective tissue ensheathing 
muscle cells or in the lacunae of bone. It is similarly the space occupied by the 
plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery 
to the interstitial space of muscle tissue is preferred for the reasons discussed 
below. They may be conveniently delivered by injection into the tissues 
comprising these cells. They are preferably delivered to and expressed in 
persistent, non-dividing cells which are differentiated, although delivery and 
expression maybe achieved in non-differentiated or less completely differentiated 
cells, such as, for example, stem cells of blood or skin fibroblasts. In vivo muscle 
cells are particularly competent in their ability to take up and express 
polynucleotides. 

For the naked METH1 or METH2 polynucleotide injection, an effective 
dosage amount of DNA or RNA will be in the range of from about 0.05 g/kg body 
weight to about 50 mg/kg body weight. Preferably the dosage will be from about 
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0.005 mg/kg to about 20 mg/kg and more preferably from about 0.05 mg/kg to 
about 5 mg/kg. Of course, as the artisan of ordinary skill will appreciate, this 
dosage will vary according to the tissue site of injection. The appropriate and 
effective dosage of nucleic acid sequence can readily be determined by those of 
ordinary skill in the art and may depend on the condition being treated and the 
route of administration. The preferred route of administration is by the parenteral 
route of injection into the interstitial space of tissues. However, other parenteral 
routes may also be used, such as, inhalation of an aerosol formulation particularly 
for delivery to lungs or bronchial tissues, throat or mucous membranes of the 
nose. In addition, naked METH1 or METH2 polynucleotide constructs can be 
delivered to arteries during angioplasty by the catheter used in the procedure. 

The dose response effects of injected METH1 or METH2 polynucleotide 
in muscle in vivo is determined as follows. Suitable METH 1 or METH2 template 
DNA for production of mRNA coding for METH1 or METH2 polypeptide is 
prepared in accordance with a standard recombinant DNA methodology. The 
template DNA, which may be either circular or linear, is either used as naked 
DNA or complexed with liposomes. The quadriceps muscles of mice are then 
injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1 .5 cm incision is made 
on the anterior thigh, and the quadriceps muscle is directly visualized. The 
METH1 or METH2 template DNA is injected in 0. 1 ml of carrier in a 1 cc syringe 
through a 27 gauge needle over one minute, approximately 0.5 cm from the distal 
insertion site of the muscle into the knee and about 0.2 cm deep. A suture is 
placed over the injection site for future localization, and the skin is closed with 
stainless steel clips. \ 

After an appropriate incubation time (e.g., 7 days) muscle extracts are 
prepared by excising the entire quadriceps. Every fifth 1 5 um cross-section of the 
individual quadriceps muscles is histochemically stained for METH 1 or METH2 
protein expression. A time course for METH 1 or METH2 protein expression may 
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be done in a similar fashion except that quadriceps from different mice are harvested at 
different times. Persistence of METH1 or METH2 DNA in muscle following injection 
may be determined by Southern blot analysis after preparing total cellular DNA and 
HIRT supernatants from injected and control mice. The results of the above 
5 experimentation in mice can be used to extrapolate proper dosages and other treatment 
. parameters in humans and other animals using METH1 or METH2 naked DNA. 

It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are possible in light of 
10 the above teachings and, therefore, are within the scope of the appended claims. 

The entire disclosure of all publications (including patents, patent applications, journal 
articles, laboratory manuals, books, or other documents) cited herein are hereby 
incorporated-by reference. — - - 

Throughout the specification, unless the context requires otherwise, the word "comprise" 
15 or variations such as "comprises" or "comprising", will be understood to imply the 
inclusion of a stated integer or group of integers but not the exclusion of any other 
integer or group of integers. 
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SEQUENCE LISTING 



<110> Iruela-Arispe, Luisa 
Hastings, Gregg A. 
Ruben, Steven M. 

<120> Methl and Meth2 Polynucleotides and Polypeptides 



<130> 1488.107PC02 



<140> 
<141> 

<150> 60/072,298 
<151> 1998-01-23 

<150> 60/098,539 
<151> 1998-08-28 

<160> 93 



<170> Patentln Ver. 2.0 

<210> 1 

<211> 3261 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2853) 



<220> 

<221> UNSURE 
<222> (3.095) 
<223> May be 



any nucleic acid 



<220> 

<221> UNSURE 
<222> (3248) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (3255) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (3261) 

<223> May be any nucleic acid 



<400> 1 

atg ggg aac gcg gag egg get ccg ggg tct egg age ttt ggg ccc gta 48 
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Met Gly Asn Ala Glu Arg Ala Pro Gly Ser Arg Ser Phe Gly Pro Val 
15 10 15 

ccc acg ctg ctg ctg etc gec gcg gcg eta ctg gec gtg teg gac gca 96 
Pro Thr Leu Leu Leu Leu Ala Ala Ala Leu Leu Ala Val Ser Asp Ala 
20 25 30 

etc ggg cgc ccc tec gag gag gac gag gag eta gtg gtg ccg gag ctg 14 4 
Leu Gly Arg Pro Ser Glu Glu Asp Glu Glu Leu Val Val Pro Glu Leu 
35 40 45 

gag cgc gee ccg gga cac ggg acc acg cgc etc cgc ctg cac gee ttt 192 
Glu Arg Ala Pro Gly His Gly Thr Thr Arg Leu Arg Leu His Ala Phe 
50 55 60 

gac cag cag ctg gat ctg gag ctg egg ccc gac age age ttt ttg gcg 240 
Asp Gin Gin Leu Asp Leu Glu Leu Arg Pro Asp Ser Ser Phe Leu Ala 
65 70 75 80 

ccc ggc ttc acg etc cag aac gtg ggg cgc aaa tec ggg tec gag acg 288 
Pro Gly Phe Thr Leu Gin Asn Val Gly Arg Lys Ser Gly Ser Glu Thr 
85 90 95 



ccg ctt ccg gaa acc gac ctg gcg cac tgc ttc tac tec ggc acc gtg 
Pro Leu Pro Glu Thr Asp Leu Ala His Cys Phe Tyr Ser Gly Thr Val 
100 105 110 

aat ggc gat ccc age teg get gec gee etc age etc tgc gag ggc gtg 
Asn Gly Asp Pro Ser Ser Ala Ala Ala Leu Ser Leu Cys Glu Gly Val 
115 120 125 

cgc ggc gee ttc tac ctg ctg ggg gag gcg tat ttc ate cag ccg ctg 
Arg Gly Ala Phe Tyr Leu Leu Gly Glu Ala Tyr Phe lie Gin Pro Leu 
130 135 140 



ccg gca cca eta cag ttc cac etc ctg egg egg aat egg cag ggc gac 
Pro Ala Pro Leu Gin Phe His Leu Leu Arg Arg Asn Arg Gin Gly Asp 
165 170 175 



aaa gcg gag acc gaa gac gag gac gaa ggg act gag ggc gag gac gaa 
Lys Ala Glu Thr Glu Asp Glu Asp Glu Gly Thr Glu Gly Glu Asp Glu 
195 200 205 



336 



384 



432 



ccc gee gec age gag cgc etc gee acc gec gec cca ggg gag aag ccg 4 80 
Pro Ala Ala Ser Glu Arg Leu Ala Thr Ala Ala Pro Gly Glu Lys Pro 
145 150 155 160 



528 



gta ggc ggc acg tgc ggg gtc gtg gac gac gag ccc egg ccg act ggg 57 6 
Val Gly Gly Thr Cys Gly Val Val Asp Asp Glu Pro Arg Pro Thr Gly 
180 185 190 



624 



ggg cct cag tgg teg ccg cag gac ccg gca ctg caa ggc gta gga cag 



672 
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Gly 



Pro Gin Trp Ser Pro Gin Asp Pro Ala Leu Gin Gly Val Gly Gin 



210 



215 220 



ccc aca gga act gga age ata aga aag aag cga ttt gtg tec agt cac 
Pro Thr Gly Thr Gly Ser lie Arg Lys Lys Arg Phe Val Ser Ser His 

230 235 240 



225 



cgc tat gtg gaa acc atg ctt gtg gca gac cag teg atg gca gaa ttc 
Arg Tyr Val Glu Thr Met Leu Val Ala Asp Gin Ser Met Ala Glu Phe 
245 250 255 

cac ggc agt ggt eta aag cat tac ctt etc acg ttg ttt teg gtg gca 
His Gly Ser Gly Leu Lys His Tyr Leu Leu Thr Leu Phe Ser Val Ala 



260 



265 270 



gee aga ttg tac aaa cac ccc.agc att cgt aat tea gtt age ctg gtg 
Ala Arg Leu Tyr Lys His Pro Ser He Arg Asn Ser Val Ser Leu Val 
275 280 285 

gtg gtg aag ate ttg gtc ate cac gat gaa cag aag ggg ccg gaa gtg 
Val Val Lys He Leu Val He His Asp Glu Gin Lys Gly Pro Glu Val 



290 



295 " 300 



gaa tta ggc cac gtg ttt aac atg cca cat gat gat gca aag cag tgt 
Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Ala Lys Gin Cys 

one A n A 



385 



390 395 400 



gee age ctt aat ggt gtg aac cag gat tec cac atg atg gcg tea atg 
Ala Ser Leu Asn Gly Val Asn Gin Asp Ser His Met Met Ala Ser Met 
405 410 415 



720 



768 



816 



864 



912 



960 



acc tec aat get gec etc act ctg egg aac ttt tgc aac tgg cag aag 
Thr Ser Asn Ala Ala Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Lys 
305 310 315 320 

cag cac aac cca ccc agt gac egg gat gca gag cac tat gac aca gca 
Gin His Asn Pro Pro Ser Asp Arg Asp Ala Glu His Tyr Asp Thr Ala 
325 330 335 

att ctt ttc acc aga cag gac ttg tgt ggg tec cag aca tgt gat act 
He Leu Phe Thr Arg Gin Asp Leu Cys Gly Ser Gin Thr Cys Asp Thr 
340 345 350 

ctt ggg atg get gat gtt gga act gtg tgt gat ccg age aga age tgc 
Leu Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ser Arg Ser Cys 
355 360 365 

tec gtc ata gaa gat gat ggt tta caa get gee ttc acc aca gee cat 1152 
Ser Val He Glu Asp Asp Gly Leu Gin Ala Ala Phe Thr Thr Ala His 
370 375 380 



1008 



1056 



1104 



1200 



1248 



ctt tec aac ctg gac cac age cag cct tgg tct cct tgc agt gee tac 



1296 



I 
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Leu Ser Asn Leu Asp His Ser Gin Pro Trp Ser Pro Cys Ser Ala Tyr 
420 425 430 

atg att aca tea ttt ctg gat aat ggt cat ggg gaa tgt ttg atg gac 1344 
Met lie Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys Leu Met Asp 

445 



435 



440 



aag cct cag aat ccc ata cag etc cca ggc gat etc cct ggc acc teg 
Lys Pro Gin Asn Pro He Gin Leu Pro Gly Asp Leu Pro Gly Thr Ser 
450 455 460 

tac gat gec aac egg cag tgc cag ttt aca ttt ggg gag gac tec aaa 
Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Gly Glu Asp Ser Lys 
465 470 475 480 

cac tgc cct gat gca gee age aca tgt age acc ttg tgg tgt acc ggc 
His Cys Pro Asp Ala Ala Ser Thr Cys Ser Thr Leu Trp Cys Thr Gly 
485 490 495 

acc tct ggt ggg gtg ctg gtg tgt caa acc aaa cac ttc ccg tgg gcg 
Thr Ser Gly Gly Val Leu Val Cys Gin Thr Lys Hxs Phe Pro Trp Ala 
500 " " 505 510 

gat ggc acc age tgt gga gaa ggg aaa tgg tgt ate aac ggc aag tgt 
Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys He Asn Gly Lys Cys 
515 520 525 

gtg aac aaa acc gac aga aag eat ttt gat aeg cct ttt cat gga age 
Val Asn Lys Thr Asp Arg Lys His Phe Asp Thr Pro Phe Hxs Gly Ser 
530 535 540 

tgg gga atg tgg ggg cct tgg gga gac tgt teg aga acg tgc ggt gga 
Trp Gly Met Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys Gly Gly 
545 550 555 560 

gga gtc cag tac acg atg agg gaa tgt gac aac cca gtc cca aag aat 
Gly Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro Lys Asn 
565 570 575 

gga ggg aag tac tgt gaa ggc aaa cga gtg cgc tac aga tec tgt aac 
Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser Cys Asn 
580 585 590 

ctt gag gac tgt cca gac aat aat gga aaa acc ttt aga gag gaa caa 
Leu Glu Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu Glu Gin 
595 600 605 

tgt gaa gca cac aac gag ttt tea aaa get tec ttt ggg agt ggg cct 
Cys Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Ser Gly Pro 
610 615 620 

gcg gtg gaa tgg att ccc aag tac get ggc gtc tea cca aag gac agg 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 
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Ala Val Glu Trp He Pro Lys Tyr Ala Gly Val Ser Pro Lys Asp Arg 
625 630 635 640 



tgc aag etc ate tgc caa gec aaa ggc att ggc tac ttc ttc gtt ttg 
Cys Lys Leu lie Cys Gin Ala Lys Gly lie Gly Tyr Phe Phe Val Leu 
645 650 655 

cag ccc aag gtt gta gat ggt act cca tgt age cca gat tec acc tct 
Gin Pro Lys Val Val Asp Gly Thr Pro Cys Ser Pro Asp Ser Thr Ser 
660 665 670 



675 



690 



695 700 



tct act tgt aaa aaa ata tea gga tea gtt act agt gca aaa cct gga 

Ser Thr Cys Lys Lys He Ser Gly Ser Val Thr Ser Ala Lys Pro Gly 
705 " 710 715 720 

tat cat gat ate ate aca att cca act gga gee acc aac ate gaa gtg 

Tyr His Asp He He Thr He Pro Thr Gly Ala Thr Asn He Glu Val 
725 730 735 

aaa cag egg aac cag agg gga tec agg aac aat ggc age ttt ctt gee 
Lys Gin Arg Asn Gin Arg Gly Ser Arg Asn Asn Gly Ser Phe Leu Ala 



740 



745 750 



ate aaa get get gat ggc aca tat att ctt aat ggt gac tac act ttg 
He Lys Ala Ala Asp Gly Thr Tyr He Leu Asn Gly Asp Tyr Thr Leu 

760 765 



755 



790 795 800 



aaa gag ccc ttg acc ate cag gtt ctt act gtg ggc aat gec ctt cga 
Lys Glu Pro Leu Thr He Gin Val Leu Thr Val Gly Asn Ala Leu Arg 
805 810 815 

cct aaa att aaa tac acc tac ttc gta aag aag aag aag gaa tct ttc 
Pro Lys He Lys Tyr Thr Tyr Phe Val Lys Lys Lys Lys Glu Ser Phe 
820 825 830 



1968 



2016 



gtc tgt gtg caa gga cag tgt gta aaa get ggt tgt gat ege ate ata 2064 
Val Cys Val Gin Gly Gin Cys Val Lys Ala Gly Cys Asp Arg He He 

680 685 



gac tec aaa aag aag ttt gat aaa tgt ggt gtt tgc ggg gga aat gga 2112 
Asp Ser Lys Lys Lys Phe Asp Lys Cys Gly Val Cys Gly Gly Asn Gly 



2160 



2208 



2256 



2304 



2352 



tec acc tta gag caa gac att atg tac aaa ggt gtt gtc ttg agg tac 
Ser Thr Leu Glu Gin Asp He Met Tyr Lys Gly Val Val Leu Arg Tyr 
770 775 780 

age ggc tec tct gcg gca ttg gaa aga att cgc age ttt age cct etc 2400 
Ser Gly Ser Ser Ala Ala Leu Glu Arg He Arg Ser Phe Ser Pro Leu 

785 



2448 



2496 



aat get ate ccc act ttt tea gca tgg gtc att gaa gag tgg ggc gaa 



2544 



2640 



2688 
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Asn Ala He Pro Thr Phe Ser Ala Trp Val He Glu Glu Trp Gly Glu 
835 840 8"5 

tgt tct aag tea tgt gaa ttg ggt tgg cag aga aga ctg gta gaa tgc 2592 
Cys Ser Lys Ser Cys Glu Leu Gly Trp Gin Arg Arg Leu Val Glu Cys 
850 855 860 

cga gac att aat gga cag cct get tec gag tgt gca aag gaa gtg aag 
Ara Asp He Asn Gly Gin Pro Ala Ser Glu Cys Ala Lys Glu Val Lys 
86 5 870 875 880 

cca gec age acc aga cct tgt gca gac cat ccc tgc ccc cag tgg cag 
Pro Ala Ser Thr Arg Pro Cys Ala Asp His Pro Cys Pro Gin Trp Gin 
885 890 895 

ctg ggg gag tgg tea tea tgt tct aag acc tgt ggg aag ggt tac aaa 2736 
Leu Gly Gib Trp Ser Ser Cys Ser Lys Thr Cys Gly Lys Gly Tyr Lys 
900 905 910 

aaa aga age ttg aag tgt ctg tec cat gat gga ggg gtg tta tct cat 
Lys Arg Ser Leu Lys Cys Leu Ser His Asp Gly Gly Val Leu Ser His 
915 920 925 

gag age tgt gat cct tta aag aaa cct aaa cat ttc ata gac ttt tgc 
Glu Ser Cys Asp Pro Leu Lys Lys Pro Lys His Phe He Asp Phe Cys 
930 935 940 

aca atg gca gaa tgc agt taa gtggtttaag tggtgttagc tttgaggcaa 
Thr Met Ala Glu Cys Ser 



2784 



2832 



2883 



945 


950 










ggcaaagtga 


ggaagggctg 


gtgcagggaa 


agcaagaagg 


ctggagggat 


ccagcgtatc 


2943 


ttgccagtaa 


ccagtgaggt 


gtatcagtaa 


ggtgggatta 


tgggggtaga 


tagaaaagga 


3003 


gttgaatcat 


cagagtaaac 


tgccagttgc 


aaatttgata 


ggatagttag 


tgaggattat 


3063 


taacctctga 


gcagtgatat 


agcataataa 


anccccgggc 


attattatta 


ttatttcttt 


3123 


tgttacatct 


attacaagtt 


tagaaaaaac 


aaagcaattg 


tcaaaaaaaa 


aaaaaaaaaa 


3183 


aaaaaaaaaa 


aaagggegge 


cgctctagag 


gatccctcga 


ggggcccaag 


ettacgegtg 


3243 


catgntgtca 


tnagtctn 










3261 



<210> 2 
<211> 950 
<212> PRT 

<213> Homo sapiens 



<400> 2 
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Met Gly Asn Ala Glu Arg Ala Pro Gly Ser Arg Ser Phe Gly Pro Val 
! 5 10 

Pro Thr Leu Leu Leu Leu Ala Ala Ala Leu Leu Ala Val Ser Asp Ala 
20 25 30 



Leu Gly Arg Pro Ser Glu 
35 



Glu Asp Glu Glu Leu Val Val Pro Glu Leu 



40 45 



His Gly Thr Thr Arg Leu Arg Leu His Ala Phe 



55 60 



Glu Arg Ala Pro Gly His Gly 
50 5 5 

Asp Gin Gin Leu Asp Leu Glu Leu Arg Pro Asp Ser Ser Phe Leu Ala 
65 ™ 75 

Pro Gly Phe Thr Leu Gin Asn Val Gly Arg Lys Ser Gly Ser Glu Thr 
85 90 95 

Pro Leu Pro Glu Thr Asp Leu Ala His Cys Phe Tyr Ser Gly Thr Val 
100 105 HO 



Asn Gly Asp Pro Ser Ser Ala 
115 



Ala Ala Leu Ser Leu Cys Glu Gly Val 
120 125 



_ r Leu Leu Gly Glu Ala Tyr Phe lie Gin Pro Leu 

130 



Arg Gly Ala Phe Ty 

135 I 40 



Pro Ala Ala Ser Glu Arg Leu 
145 



Ala Thr Ala Ala Pro Gly Glu Lys Pro 
150 155 160 



Pro Ala Pro Leu Gin Phe His Leu Leu Arg Arg Asn Arg Gin Gly Asp 
165 110 175 

Val Gly Gly Thr Cys Gly Val Val Asp Asp Glu Pro Arg Pro Thr Gly 
180 185 I 90 

Lys Ala Glu Thr Glu Asp Glu Asp Glu Gly Thr Glu Gly Glu Asp Glu 
195 200 205 

Gly Pro Gin Trp Ser Pro Gin Asp Pro Ala Leu Gin Gly Val Gly Gin 



210 



215 220 



Pro Thr Gly Thr Gly Ser lie Arg Lys Lys Arg Phe Val Ser Ser His 

225 230 235 240 

Ara Tyr Val Glu Thr Met Leu Val Ala Asp Gin Ser Met Ala Glu Phe 

245 250 255 

His Gly Ser Gly Leu Lys His Tyr Leu Leu Thr Leu Phe Ser Val Ala 

260 265 270 

Ala Arg Leu Tyr Lys His Pro Ser He Arg Asn Ser Val Ser Leu Val 
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275 280 285 

Val Val Lys He Leu Val He His Asp Glu Gin Lys Gly Pro Glu Val 
290 295 300 

Thr Ser Asn Ala Ala Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Lys 
305 310 315 320 

Gin His Asn Pro Pro Ser Asp Arg Asp Ala Glu His Tyr Asp Thr Ala 
325 330 335 

He Leu Phe Thr Arg Gin Asp Leu Cys Gly Ser Gin Thr Cys Asp Thr 
340 345 350 

Leu Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ser Arg Ser Cys 
355 360 365 

Ser Val He Glu Asp Asp Gly Leu Gin Ala Ala Phe Thr Thr Ala His 
370 375 380 

Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Ala Lys Gin Cys 
385 390 - ... .395 400 

Ala Ser Leu Asn Gly Val Asn Gin Asp Ser His Met Met Ala Ser Met 
405 410 415 

Leu Ser Asn Leu Asp His Ser Gin Pro Trp Ser Pro Cys Ser Ala Tyr 
420 425 430 

Met He Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys. Leu Met Asp 
435 440 445 

Lys Pro Gin Asn Pro He Gin Leu Pro Gly Asp Leu Pro Gly Thr Ser 
450 455 460 

Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Gly Glu Asp Ser Lys 
465 470 475 480 

His Cys Pro Asp Ala Ala Ser Thr Cys Ser Thr Leu Trp Cys Thr Gly 
485 490 495 

Thr Ser Gly Gly Val Leu Val Cys Gin Thr Lys His Phe Pro Trp Ala 
500 505 510 

Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys He Asn Gly Lys Cys 
515 520 525 

Val Asn Lys Thr Asp Arg Lys His Phe Asp Thr Pro Phe His Gly Ser 
530 535 540 

Trp Gly Met Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys Gly Gly 
545 550 555 560 
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Glv Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro Lys Asn 
565 570 575 

Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser Cys Asn 
580 585 590 



Leu Glu Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu Glu Gin 
595 



600 605 



Cys Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Ser Gly Pro 
610 615 620 



Ala Val Glu Trp He Pro Lys Tyr Ala Gly Val Ser Pro Lys Asp Arg 
625 



630 635 640 



Cvs Lys Leu He Cys Gin Ala Lys Gly He Gly Tyr Phe Phe Val Leu 
645 650 655 

Gin Pro Lys Val Val Asp Gly Thr Pro Cys Ser Pro Asp Ser Thr Ser 
660 665 670 

Val Cys Val Gin Gly Gin Cys Val Lys Ala Gly Cys Asp Arg He He 
675 680 685 

Asp Ser Lys Lys Lys Phe Asp Lys Cys Gly Val Cys Gly Gly Asn Gly 
690 695 700 

Ser Thr Cys Lys Lys He Ser Gly Ser Val Thr Ser Ala Lys Pro Gly 
705 710 715 720 

Tyr His Asp He He Thr He Pro Thr Gly Ala Thr Asn lie Glu Val 
725 730 735 

Lys Gin Arg Asn Gin Arg Gly Ser Arg Asn Asn Gly Ser Phe Leu Ala 
740 745 750 

lie Lys Ala Ala Asp Gly Thr Tyr He Leu Asn Gly Asp Tyr Thr Leu 
755 7 60 765 

Ser Thr Leu Glu Gin Asp He Met Tyr Lys Gly Val Val Leu Arg Tyr 
770 775 780 



Ser Gly Ser Ser Ala Ala Leu Glu Arg He Arg Ser Phe Ser Pro Leu 
785 



7 90 7 95 800 



Lys Glu Pro Leu Thr He Gin Val Leu Thr Val Gly Asn Ala Leu Arg 
805 810 815 

Pro Lys He Lys Tyr Thr Tyr Phe Val Lys Lys Lys Lys Glu Ser Phe 
820 825 830 
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Asn Ala He Pro 
835 

Cys Ser Lys Ser 
850 

Arg Asp He Asn 
865 

Pro Ala Ser Thr 



Leu Gly Glu Trp 
900 

Lys Arg Ser Leu 
915 

Glu Ser Cys Asp 
930 

Thr- Met- Ala Glu 
945 



Thr Phe Ser Ala 
840 

Cys Glu Leu Gly 
855 

Gly Gin Pro Ala 
870 

Arg Pro Cys Ala 
885 

Ser Ser Cys Ser 



Lys Cys Leu Ser 
920 

Pro Leu Lys Lys 
935 

Cys Ser 
950 



Trp Val He Glu 



Trp Gin Arg Arg 
860 

Ser Glu Cys Ala 
875 

Asp His Pro Cys 
890 

Lys Thr Cys Gly 
905 

His Asp Gly Gly 



Pro Lys His Phe 
940 



Glu Trp Gly Glu 
845 

Leu Val Glu Cys 



Lys Glu Val Lys 
880 

Pro Gin Trp Gin 
895 

Lys Gly Tyr Lys 
910 

Val Leu Ser His 
925 

lie Asp Phe Cys 



<210> 3 

<211> 3008 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . - (2670) 

<220> 

<221> UNSURE. 
<222> (2887) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2957) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2970) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2981). 

<223> May be any nucleic acid 
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<400> 3 

atg ttc ccc gcc ccc gcc gcc ccc egg tgg ctt ccg ttc ctg ctg ctg 
Met Phe Pro Ala Pro Ala Ala Pro Arg Trp Leu Pro Phe Leu Leu Leu 
1 5 10 15 

ctg ctg ctg ctg ctg ctg ccg ctg gcc cgc ggc gcc ccg gcc egg ccc 
Leu Leu Leu Leu Leu Leu Pro Leu Ala Arg Gly Ala Pro Ala Arg Pro 
20 25 30 

gca gcc ggg ggg cag gcc teg gag ctg gtg gtg ccc acg egg ttg ccc 
Ala Ala Gly Gly Gin Ala Ser Glu Leu Val Val Pro Thr Arg Leu Pro 
35 40 45 

ggc age gcg ggc gag etc gcg etc cac ctg tec gcc ttc ggc aag ggc 

Gly Ser Ala Gly Glu Leu Ala Leu His Leu Ser Ala Phe Gly Lys Gly 

50 55 60 

-\ 

ttc gtg ttg cgc ctg gcg ccc gac gac age ttc ctg gcg ccc gag ttc 

Phe Val Leu Arg Leu Ala Pro Asp Asp Ser Phe Leu Ala Pro Glu Phe 

65 10 75 80 



gaa gag gag gca gaa ggc get age gag ccg cca ccg ccc ctg ggg gcc 
Glu Glu Glu Ala Glu Gly Ala Ser Glu Pro Pro Pro Pro Leu Gly Ala 
195 200 205 



48 



96 



144 



192 



240 



aag ate gag cgc etc ggg ggc tec ggc egg gcg acc ggg ggc gag egg 288 
Lys He Glu Arg Leu Gly Gly Ser Gly Arg Ala Thr Gly Gly Glu Arg 
85 90 95 

ggg ctg cgc ggc tgt ttt ttt tec ggc acc gtg aat ggg gag ccc gag 336 
Gly Leu Arg Gly Cys Phe Phe Ser Gly Thr Val Asn Gly Glu Pro Glu 
100 105 HO 

teg ctg gcg gcg gtc age ctg tgc cgc ggg ctg age ggc tec ttc ctg 384 
Ser Leu Ala Ala Val Ser Leu Cys Arg Gly Leu Ser Gly Ser Phe Leu 
115 120 125 

ctg gac ggc gag gag ttc acc ate cag ccg cag ggc gcg ggg ggc tec 432 
Leu Asp Gly Glu Glu Phe Thr He Gin Pro Gin Gly Ala Gly Gly Ser 
130 135 140 



528 



ctg get cag ccg cac cgc ctg cag cgc tgg ggt ccc gcc gga gcc cgc 480 

Leu Ala Gin Pro His Arg Leu Gin Arg Trp Gly Pro Ala Gly Ala Arg 
145 150 155 160 

ccc etc ccg cga gga ccc gag tgg gag gtg gag acg gga gag ggt cag 

Pro Leu Pro Arg Gly Pro Glu Trp Glu Val Glu Thr Gly Glu Gly Gin 
165 170 175 

agg cag gag aga gga gac cac cag gag gac age gag gag gdg age caa 57 6 

Arg Gin Glu Arg Gly Asp His Gin Glu Asp Ser Glu Glu Glu Ser Gin 
180 185 190 



624 
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acg agt agg acc aag egg ttt gtg tct gag gcg cgc ttc gtg gag acg 672 
Thr Ser Arg Thr Lys Arg Phe Val Ser Glu Ala Arg Phe Val Glu Thr 
210 215 220 



ctg ctg gtg gec gat gcg tec atg get gec ttc tac ggg gee gac ctg 
Leu Leu Val Ala Asp Ala Ser Met Ala Ala Phe Tyr Gly Ala Asp Leu 

230 235 240 



225 



ate gta gaa gat gaa aaa tgg ggc oca gag gtg tec gac aat ggg ggg 
He Val Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly 
275 280 285 



cag aac ttc tgt ggg cag gag ggg ctg tgt gac acc ctg ggt gtg gca 
Gin Asn Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala 
325 330 335 

gac ate ggg acc att tgt gac ccc aac aaa age tgc 'tec gtg ate gag 
Asp He Gly Thr He Cys Asp Pro Asn Lys Ser Cys Ser Val He Glu 
340 345 350 

gat gag ggg etc cag gcg gec cac acc ctg gee cat gaa eta ggg cac 
Asp Glu Gly Leu Gin Ala Ala His Thr Leu Ala His Glu Leu Gly His 
355 360 365 

gtc etc age atg ccc cac gac gac tec aag ccc tgc aca egg etc ttc 
Val Leu Ser Met Pro His Asp Asp Ser Lys Pro Cys Thr Arg Leu Phe 
370 375 380 

ggg ccc atg ggc aag cac cac gtg atg gca ccg ctg ttc gtc cac ctg 
Gly Pro Met Gly Lys His His Val Met Ala Pro Leu Phe Val His Leu 
385 390 395 400 

aac cag acg ctg ccc tgg tec ccc tgc age gec atg tat etc aca gag 
Asn Gin Thr Leu Pro Trp Ser Pro Cys Ser Ala Met Tyr Leu Thr Glu 
405 410 415 



720 



768 



cag aac cac ate ctg acg tta atg tct gtg gca gee cga ate tac aag 

Gin Asn His He Leu Thr Leu Met Ser Val Ala Ala Arg He Tyr Lys 
245 250 255 

cac ccc age ate aag aat tec ate aac ctg atg gtg gta aaa gtg ctg 816 

His Pro Ser He Lys Asn Ser He Asn Leu Met Val Val Lys Val Leu 
260 '265 270 



864 



ctt aca ctg cgt aac ttc tgc aac tgg cag egg cgt ttc aac cag ccc 912 

Leu Thr Leu Arg. Asn Phe . Cys Asn Trp Gin Arg. Arg Phe Asn Gin Pro 
290 295 300 

age gac cgc cac cca gag cac tac gac acg gee ate ctg etc acc aga 960 

Ser Asp Arg His Pro Glu His Tyr Asp Thr Ala He Leu Leu Thr Arg 
305 310 315 320 



1008 



1056 



1104 



1152 



1200 



1248 
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ctt ctg gac ggc ggg cac gga gac tgt etc ctg gat gec cct ggt gcg 1296 
Leu Leu Asp Gly Gly His Gly Asp Cys Leu Leu Asp Ala Pro Gly Ala 
420 425 430 



gec ctg ccc etc ccc aca ggc etc ccg ggc cgc atg gec ctg tac cag 

Ala Leu Pro Leu Pro Thr Gly Leu Pro Gly Arg Met Ala Leu Tyr Gin 

435 440 445 

ctg gac cag cag tgc agg cag ate ttt ggg ccg gat ttc cgc cac tgc 

Leu Asp Gin Gin Cys Arg Gin lie Phe Gly Pro Asp Phe Arg His Cys 

450 455 460 

ccc aac acc tct get cag gac gtc tgc gec cag ctt tgg tgc cac act 

Pro Asn Thr Ser Ala Gin Asp Val Cys Ala Gin Leu Trp Cys His Thr 

465 470 475 480 



tgt gag aag tat aat gee tac aat tac act gac atg gac ggg aat etc 

Cys Glu Lys Tyr Asn Ala Tyr Asn Tyr Thr Asp Met Asp Gly Asn Leu 
595 600 605 

ctg cag tgg gtc ccc aag tat get ggg gtg tec ccc egg gac cgc tgc 

Leu Gin Trp Val Pro Lys Tyr Ala Gly Val Ser Pro Arg Asp Arg Cys 

610 615 620 



1344 



1392 



1440 



1488 



gat ggg get gag ccc ctg tgc cac acg aag aat ggc age ctg ccc tgg 
Asp Gly Ala Glu Pro Leu Cys His Thr Lys Asn Gly Ser Leu Pro Trp 
485 490 495 

get gac ggc acg ccg tgc ggg- cct ggg cac etc tgc tea gaa ggc age 1536 
Ala Asp Gly" Thr Pro Cys Gly Pro Gly His Leu Cys Ser Glu Gly Ser 
500 505 510 

tgt eta cct gag gag gaa gtg gag agg ccc aag ccc gtg gta gat gga 1584 
Cys Leu Pro Glu Glu Glu Val Glu Arg Pro Lys Pro Val Val Asp Gly 
515 520 525 

ggc tgg gca ccg tgg gga ccc tgg gga gaa tgt tct egg acc tgt gga 1632 
Gly Trp Ala Pro Trp Gly Pro Trp Gly Glu Cys Ser Arg Thr Cys Gly 
530 535 540 

gga gga gta cag ttt tea cac cgt gag tgc aag gac ccc gag cct cag 1680 
Gly Gly Val Gin Phe Ser His Arg Glu Cys Lys Asp Pro Glu Pro Gin 
545 550 555 560 

aat gga gga aga tac tgc ctg ggt egg aga gee aag tac cag tea tgc 1728 
Asn Gly Gly Arg Tyr Cys Leu Gly Arg Arg Ala Lys Tyr Gin Ser Cys 
565 570 575 

cac acg gag gaa tgc ccc cct gac ggg aaa age ttc agg gag cag cag 177 6 
His Thr Glu Glu Cys Pro Pro Asp Gly Lys Ser Phe Arg Glu Gin Gin 
580 585 590 



1824 



1872 
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aag ttg ttc 
Lys Leu Phe 
625 

gcc aag gtg 
Ala Lys Val 



tgt gtc cgt 
Cys Val Arg 



teg cct egg 
Ser Pro Arg 
67 5 

tec tgc agg 
Ser Cys Arg 
690 

aat gac att 
Asn Asp Jlie 
705 

cag egg age 
Gin Arg Ser 



aag acg get 
Lys Thr Ala 



gcc ata gag 
Ala lie Glu 
755 

ggc tec ate 
Gly Ser He 
770 

gag cct ctg 
Glu Pro Leu 
785 

cca aaa gtc 
Pro Lys Val 



atg cag age 
Met Gin Ser 



tgc cga gcc 
Cys Arg Ala 
630 

att gat ggc 
He Asp Gly 
645 

ggc cag tgt 
Gly Gin Cys 
660 

aag ctg gac 
Lys Leu Asp 



aag gtc tec 
Lys Val Ser 



gtc acc ate 
Val /Thr He 
710 

cac ccg ggt 
His Pro Gly 
725 

gat ggg cag 
Asp Gly Gin 
740 

cag gac ate 
Gin Asp He 



gcc acc ctg 
Ala Thr Leu 



aca gtg cag 
Thr Val Gin 
790 

aaa tac acc 
Lys Tyr Thr 
805 

age aaa gag 
Ser Lys Glu 
820 



egg ggg agg 
Arg Gly Arg 



acc ctg tgt 
Thr Leu Cys 



gtc aag gcc 
Val Lys Ala 
665 

aaa tgc ggg 
Lys Cys Gly 
680 

ggg tec etc 
Gly Ser Leu 
695 

cca get ggt 
Pro Ala Gly 



gtg cag aac 
Val Gin Asn 



tac ctg etc 
Tyr Leu Leu 
745 

ttg gtg aag 
Leu Val Lys 
7 60 

gag cgc ctg 
Glu Arg Leu 
775 

etc ctg aca 
Leu Leu Thr 



ttc ttt gtt 
Phe Phe Val 



aga gca acc 
Arg Ala Thr 
825 



age gag ttc 
Ser Glu Phe 
635 

ggg cca gaa 
Gly Pro Glu 
650 

ggc tgt gac 
Gly Cys Asp 



gtg tgt ggg 
Val Cys Gly 



acc ccc acc 
Thr Pro Thr 
700 

gcc act aat 
Ala Thr Asn 
" 715 

gat ggg aac 
Asp Gly Asn 
730 

aac ggc aac 
Asn Gly Asn 



ggg acc ate 
Gly Thr He 



cag age ttc 
Gin Ser Phe 
780 

gtc cct ggc 
Val Pro Gly 
795 

cct aat gac 
Pro Asn Asp 
810 

acc aac ate 
Thr Asn He 



aaa gtg ttc 
Lys Val Phe 



aca ctg gcc 
Thr Leu Ala 
655 

cat gtg gtg 
His Val Val 
670 

ggc aaa ggc 
Gly Lys Gly 
685 

aat tat ggc 
Asn Tyr Gly 



att gac gtg 
He Asp Val 



tac ctg gcg 
Tyr Leu Ala 
735 

ctg gcc ate 
Leu Ala lie 
750 

ctg aag tac 
Leu Lys Tyr 
765 

egg ccc ttg 
Arg Pro Leu 



gag gtc ttc 
Glu Val Phe 



gtg gac ttt 
Val Asp Phe 
815 

ate cag ccg 
lie Gin Pro 
830 



gag 1920 

Glu 

640 

ate 1968 
He 



gac 2016 
Asp 



aac 2064 
Asn 



tac 2112 
Tyr 



aag 2160 

Lys 

720 

ctg 2208 
Leu 



tct 2256 
Ser 



age 2304 
Ser 



cca 2352 
Pro 



ccc 2400 

Pro 

800 

age 2448 
Ser 



ctg 2496 
Leu 
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etc cac gca cag tgg gtg ctg ggg gac tgg tct gag tgc tct age acc 2544 
Leu His Ala Gin Trp Val Leu Gly Asp Trp Ser Glu Cys Ser Ser Thr 
835 840 845 

tgc ggg gec ggc tgg cag agg cga act gta gag tgc agg gac ccc tec 2592 
Cys Gly Ala Gly Trp Gin Arg Arg Thr Val Glu Cys Arg Asp Pro Ser 
850 855 860 

ggc cag gec tct gec acc tgc aac aag get ctg aaa ccc gag gat gee 2640 
Gly Gin Ala Ser Ala Thr Cys Asn Lys Ala Leu Lys Pro Glu Asp Ala 
865 870 875 880 

aag ccc tgc gaa age cag ctg tgc ccc ctg tgattcaggg gggcaggggc 2690 
Lys Pro Cys Glu Ser Gin Leu Cys Pro Leu 
885 890 

cagtcttgtg ctcctggaca tgcggtactg aggtgeagae aaggtctcca ctgtggtgac 2750 

tgggtccctt ggecatatea aggcagcacg gcccacccag gcctcccatt gccgcaaccc 2810 

ctccagtact gcacaaattc ctaaggggga agagaaaagg tatggggegg caaaacctat 2870 

catcaactgt ccawtgnaat ggaacttget cgggttcaat taaaggcata agttaaagta 2930 

aattcattat gatcaacaga cctcacntca tctgttgcan gatacaacta ntaaaaaaaa 2990 

aaaaaaaaaa aaaaaaaa 3008 



<210> 4 
<211> 890 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Phe Pro Ala Pro Ala Ala Pro Arg Trp Leu Pro Phe Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Pro Leu Ala Arg Gly Ala Pro Ala Arg Pro 
20 25 30 

Ala Ala Gly Gly Gin Ala Ser Glu Leu Val Val Pro Thr Arg Leu Pro 
35 40 45 

Gly Ser Ala Gly Glu Leu Ala Leu His Leu Ser Ala Phe Gly Lys Gly 
50 55 60 

Phe Val Leu Arg Leu Ala Pro Asp Asp Ser Phe Leu Ala Pro Glu Phe 
65 70 75 80 

Lys lie Glu Arg Leu Gly Gly Ser Gly Arg Ala Thr Gly Gly Glu Arg 
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85 90 95 

Gly Leu Arg Gly Cys Phe Phe Ser Gly Thr Val Asn Gly Glu Pro Glu 
100 105 110 

Ser Leu Ala Ala Val Ser Leu Cys Arg Gly Leu Ser Gly Ser Phe Leu 
115 120 125 

Leu Asp Gly Glu Glu Phe Thr lie Gin Pro Gin Gly Ala Gly Gly Ser 
130 135 140 

Leu Ala Gin Pro His Arg Leu Gin Arg Trp Gly Pro Ala Gly Ala Arg 
145 150 155 160 

Pro Leu Pro Arg Gly Pro Glu Trp Glu Val Glu Thr Gly Glu Gly Gin 
165 170 175 

Arg Gin Glu Arg Gly Asp His Gin Glu Asp Ser Glu Glu Glu Ser Gin 
180 185 190 

Glu Glu Glu Ala Glu Gly Ala Ser Glu Pro Pro Pro Pro Leu Gly Ala 
195 200 205 

Thr Ser Arg Thr Lys Arg Phe Val Ser Glu Ala Arg Phe Val Glu Thr 
210 215 220 

Leu Leu Val Ala Asp Ala Ser Met Ala Ala Phe Tyr Gly Ala Asp Leu 
225 230 235 240 

Gin Asn His He Leu Thr Leu Met Ser Val Ala Ala Arg He Tyr Lys 
245 250 255 

His Pro Ser He Lys Asn Ser He Asn Leu Met Val Val Lys Val Leu 
260 265 270 

He Val Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly 
275 280 285 

Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Arg Arg Phe Asn Gin Pro 
290 295 300 

Ser Asp Arg His Pro Glu His Tyr "Asp Thr Ala lie Leu Leu Thr Arg 
305 310 315 320 

Gin Asn Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala 
325 330 335 

Asp lie Gly Thr He Cys Asp Pro Asn Lys Ser Cys Ser Val He Glu 
340 345 350 



Asp Glu Gly Leu Gin Ala Ala His Thr Leu Ala His Glu Leu Gly His 
355 360 365 
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Val Leu Ser Met Pro 
370 

Gly Pro Met Gly Lys 
385 

Asn Gin Thr Leu Pro 
405 

Leu Leu Asp Gly Gly 
420 

Ala Leu Pro Leu Pro 
435 

Leu Asp Gin Gin Cys 
450 - 

Pro Asn Thr Ser Ala 
465 

Asp Gly Ala Glu Pro 
485 

Ala Asp Gly Thr Pro 
500 

Cys Leu Pro Glu Glu 
515 

Gly Trp Ala Pro Trp 
530 

Gly Gly Val Gin Phe 
545 

Asn Gly Gly Arg Tyr 
565 

His Thr Glu Glu Cys 
580 

Cys Glu Lys Tyr Asn 
595 

Leu Gin Trp Val Pro 
610 
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His Asp Asp Ser Lys Pro 
375 

His His Val Met Ala Pro 
390 395 

Trp Ser Pro Cys Ser Ala 
410 

His Gly Asp Cys Leu Leu 
425 

Thr Gly Leu Pro Gly Arg 
440 

Arg Gin lie Phe Gly Pro 
455 

Gin Asp Val Cys Ala Gin 
470 475 

Leu "Cys His Thr Lys Asn 
4 90 

Cys Gly Pro Gly His Leu 
505 

Glu Val Glu Arg Pro Lys 
520 

Gly Pro Trp Gly Glu Cys 
535 

Ser His Arg Glu Cys Lys 
550 555 

Cys Leu Gly Arg Arg Ala 
570 

Pro Pro Asp Gly Lys Ser 
585 

Ala Tyr Asn Tyr Thr Asp 
600 

Lys Tyr Ala Gly Val Ser 
615 



Cys Thr Arg Leu Phe 
380 

Leu Phe Val His Leu 
400 

Met Tyr Leu Thr Glu 
415 

Asp Ala Pro Gly Ala 
430 

Met Ala Leu Tyr Gin 
445 

Asp Phe Arg His Cys 
4 60 

Leu Trp Cys His Thr 
480 

Gly Ser Leu Pro Trp 
495 

Cys Ser Glu Gly Ser 
510 

Pro Val Val Asp Gly 
525 

Ser Arg Thr Cys Gly 
540 

Asp Pro Glu Pro Gin 
560 

Lys Tyr Gin Ser Cys 
575 

Phe Arg Glu Gin Gin 
590 

Met Asp Gly Asn Leu 
605 

Pro Arg Asp Arg Cys 
620 



Lys Leu Phe Cys Arg Ala Arg Gly Arg Ser Glu Phe Lys Val Phe Glu 
625 630 635 640 
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Ala Lys Val lie Asp Gly Thr Leu Cys Gly Pro Glu Thr Leu Ala lie 
645 650 655 

Cys Val Arg Gly Gin Cys Val Lys Ala Gly Cys Asp His Val Val Asp 
660 665 670 

Ser Pro Arg Lys Leu Asp Lys Cys Gly Val Cys Gly Gly Lys Gly Asn 
675 680 685 

Ser Cys Arg Lys Val Ser Gly Ser Leu Thr Pro Thr Asn Tyr Gly Tyr 
690 695 700 

Asn Asp lie Val Thr lie Pro Ala Gly Ala Thr Asn lie Asp Val Lys 
705 710 715 720 

Gin Arg Ser His Pro Gly Val Gin Asn Asp Gly Asn Tyr Leu Ala Leu 
725 730 735 

Lys Thr Ala Asp Gly Gin Tyr Leu Leu Asn Gly Asn Leu Ala lie Ser 
740 745 750 

Ala He Glu Gin Asp He Leu Val Lys Gly Thr He Leu Lys Tyr Ser 

755 _ 760 " 765 " 

Gly Ser lie Ala Thr Leu Glu Arg Leu Gin Ser Phe Arg Pro Leu Pro 
770 775 780 

Glu Pro Leu Thr Val Gin Leu Leu Thr Val Pro Gly Glu Val Phe Pro 
785 790 795 800 

Pro Lys Val Lys Tyr Thr Phe Phe Val Pro Asn Asp Val Asp Phe Ser 
805 810 815 

Met Gin Ser Ser Lys Glu Arg Ala Thr Thr Asn He He Gin Pro Leu 
820 825 830 

Leu His Ala Gin Trp Val Leu Gly Asp Trp Ser Glu Cys Ser Ser Thr 
835 840 845 

Cys Gly Ala Gly Trp Gin Arg Arg Thr Val Glu Cys Arg Asp Pro Ser 
850 855 860 

Gly Gin Ala Ser Ala Thr Cys Asn Lys Ala Leu Lys Pro Glu Asp Ala 
865 870 875 880 

Lys Pro Cys Glu Ser Gin Leu Cys Pro Leu * 
885 890 



<210> 5 
<211> 1203 
<212> PRT 
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<213> Bovine 
<400> 5 

Met Asp Pro Pro 
1 

Leu Leu Leu Leu 
20 

Ala Ala Ala Asp 
35 

lie Leu Ala Val 
50 

His Val Val Ser 
65 

Ala Ala Pro Ala 



Pro Gly Gly Arg 
100 

His Leu Arg Leu 
115 

Val Glu Trp Gin 
130 

Gly Thr Cys Leu 
145 

Ser Val Ala Leu 



Glu Glu Glu Glu 
180 

Lys Glu Ala Glu 
195 

Thr Ser Arg Pro 
210 

Ala Asp Ser Leu 
225 

Arg Val Asn Ser 



Ala Gly Ala Ala 
5 

Leu Leu Pro Leu 



Pro Pro Gly Gly 
40 

Pro Val Arg Thr 
55 

Ala Ala Thr Ala 
70 

Gin lie Pro Gly 
85 

-Leu Phe Tyr Asn 



Arg Pro Asn Ala 
120 

Gly Glu Ser Gly 
135 

Tyr Val Gly Asp 
150 

Ser Asn Cys Asp 
165 

Phe Phe lie Glu 



Gin Gly Arg Val 
200 

Pro Pro Leu Gly 
215 

Asp Ser Leu Ser 
230 

Ser Arg Arg Arg 
245 
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Gly Arg Leu Leu 
10 

Pro Ala Asp Ala 
25 

Pro Gin Gly His 



Asp Ala Gin Gly 
60 

Pro Ala Gly Val 
75 

Leu Ser Gly Gly 
90 

Val Thr Val Phe 
105 

Arg Leu Val Ala 



Ala Thr Arg Val 
140 

Val Ala Gly Leu 
155 

Gly Leu Ala Gly 
170 

Pro Leu Glu Lys 
185 

His Val Val Tyr 



Gin Ala Leu Asp 
220 

Arg Ala Leu Gly 
235 

Met Arg Arg His 
250 



Cys Pro Ala Leu 
15 

Arg Leu Ala Ala 
30 

Gly Ala Glu Arg 
45 

Arg Leu Val Ser 



Arg Thr Arg Arg 
80 

Ser Glu Glu Asp 
95 

Gly Arg Asp Leu 
110- 

Pro Gly Ala Thr 
125 

Glu Pro Leu Leu 



Ala Glu Ser Ser 
160 

Leu lie Arg Met 
175 

Gly Leu Ala Ala 
190 

His Arg Pro Thr 
205 

Thr Gly lie Ser 



Val Leu Glu Glu 
240 

Ala Ala Asp Asp 
255 



Asp Tyr Asn lie Glu Val Leu Leu Gly Val Asp Asp Ser Val Val Gin 
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260 265 270 

Phe His Gly Thr Glu His Val Gin Lys Tyr Leu Leu Thr Leu Met Asn 
275 280 285 

lie Val Asn Glu lie Tyr His Asp Glu Ser Leu Gly Ala His lie Asn 
290 295 300 

Val Val Leu Val Arg lie He Leu Leu Ser Tyr Gly Lys Ser Met Ser 
305 310 315 320 

Leu He Glu He Gly Asn Pro Ser Gin Ser Leu Glu Asn Val Cys Arg 
325 330 335 

Trp Ala Tyr Leu Gin Gin Lys Pro Asp Thr Asp His Asp Glu Tyr His 
340 345 350 

Asp His Ala He Phe Leu Thr Arg Gin Asp Phe Gly Pro Ser Gly Met 
355 360 365 

Gin Gly Tyr Ala Pro Val Thr Gly Met Cys His Pro Val Arg Ser Cys 
370 375 380 

Thr Leu Asn His Glu Asp Gly Phe Ser Ser Ala Phe Val Val Ala His 
385 390 395 400 

Glu Thr Gly His Val Leu Gly Met Glu His Asp Gly Gin Gly Asn Arg 
405 410 415 

Cys Gly Asp Glu Val Arg Leu Gly Ser lie Met Ala Pro Leu Val Gin 
420 425 430 

Ala Ala Phe His Arg Phe His Trp Ser Arg Cys Ser Gin' Gin Glu Leu 
435 440 445 

Ser Arg Tyr Leu His Ser Tyr Asp Cys Leu Arg Asp Asp Pro Phe Thr 
450 455 460 

His Asp Trp Pro Ala Leu Pro Gin Leu Pro Gly Leu His Tyr Ser Met 
465 470 475 480 

Asn Glu Gin Cys Arg Phe Asp Phe Gly Leu Gly Tyr Met Met Cys Thr 
485 490 495 

Ala Phe Arg Thr Phe Asp Pro Cys Lys Gin Leu Trp Cys Ser His Pro 
500 505 510 

Asp Asn Pro Tyr Phe Cys Lys Thr Lys Lys Gly Pro Pro Leu Asp Gly 
515 520 525 



Thr Met Cys Ala Pro Gly Lys His Cys Phe Lys Gly His Cys He Trp 
530 535 540 
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Leu Thr Pro Asp lie Leu Lys Arg Asp Gly Asn Trp Gly Ala Trp Ser 
545 550 555 560 

Pro Phe Gly Ser Cys Ser Arg Thr Cys Gly Thr Gly Val Lys Phe Arg 
565 570 575 

Thr Arg Gin Cys Asp Asn Pro His Pro Ala Asn Gly Gly Arg Thr Cys 
580 585 590 

Ser Gly Leu Ala Tyr Asp Phe Gin Leu Cys Asn Ser Gin Asp Cys Pro 
595 600 605 

Asp Ala Leu Ala Asp Phe Arg Glu Glu Gin Cys Arg Gin Trp Asp Leu 
610 615 620 

Tyr Phe Glu His Gly Asp Ala Gin His His Trp Leu Pro His Glu His 
625 630 635 640 

Arg Asp Ala Lys Glu Arg Cys His Leu Tyr Cys Glu Ser Lys Glu Thr 
645 650 655 

Gly Glu Val Val Ser Met Lys Arg Met Val His Asp Gly Thr Arg Cys 
660 665 670 

Ser Tyr Lys Asp Ala Phe Ser Leu Cys Val Arg Gly Asp Cys Arg Lys 
675 680 685 

Val Gly Cys Asp Gly Val lie Gly Ser Ser Lys Gin Glu Asp Lys Cys 
690 695 700 

Gly Val Cys Gly Gly Asp Asn Ser His Cys Lys Val Val Lys Gly Thr 
705 3 710 715 720 

Phe Ser Arg Ser Pro Lys Lys Leu Gly Tyr lie Lys Met Phe Glu lie 
725 730 735 

Pro Ala Gly Ala Arg His Leu Leu lie Gin Glu Ala Asp Thr Thr Ser 
740 745 750 

His His Leu Ala Val Lys Asn Leu Glu Thr Gly Lys Phe lie Leu Asn 
755 760 765 

Glu Glu Asn Asp Val Asp Pro Asn Ser Lys Thr Phe lie Ala Met Gly 
770 775 780 

Val Glu Trp Glu Tyr Arg Asp Glu Asp Gly Arg Glu Thr Leu Gin Thr 
785 "790 795 800 



Met Gly Pro Leu His Gly Thr lie Thr Val Leu Val lie Pro Glu Gly 
805 810 815 
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Asp Ala Arg lie Ser Leu Thr Tyr Lys Tyr Met lie His Glu Asp Ser 
820 825 830 

Leu Asn Val Asp Asp Asn Asn Val Leu Glu Asp Asp Ser Val Gly Tyr 
835 840 845 

Glu Trp Ala Leu Lys Lys Trp Ser Pro Cys Ser Lys Pro Cys Gly Gly 
850 855 860 

Gly Ser Gin Phe Thr Lys Tyr Gly Cys Arg Arg Arg Leu Asp His Lys 
865 870 875 880 

Met Val His Arg Gly Phe Cys Asp Ser Val Ser Lys Pro Lys Ala lie 
885 890 895 

Arg Arg Thr Cys Asn Pro Gin Glu Cys Ser Gin Pro Val Trp Val Thr 
900 905 910 

Gly Glu Trp Glu Pro Cys Ser Arg Ser Cys Gly Arg Thr Gly Met Gin 
915 920 925 

Val Arg Ser Val Arg Cys Val Gin Pro Leu His Asn Asn Thr Thr Arg 

. - 930 . 935 940 . . . 

Ser Val His Thr Lys His Cys Asn Asp Ala Arg Pro Glu Gly Arg Arg 
945 950 955 960 

Ala Cys Asn Arg Glu Leu Cys Pro Gly Arg Trp Arg Ala Gly Ser Trp 
965 970 975 

Ser Gin Cys Ser Val Thr Cys Gly Asn Gly Thr Gin Glu Arg Pro Val 
980 985 990 

Leu Cys Arg Thr Ala Asp Asp Ser Phe Gly Val Cys Arg Glu Glu Arg 
995 1000 1005 

Pro Glu Thr Ala Arg He Cys Arg Leu Gly Pro Cys Pro Arg Asn Thr 
1010 1015 1020 

Ser Asp Pro Ser Lys Lys Ser Tyr Val Val Gin Trp Leu Ser Arg Pro 
1025 1030 1035 1040 

Asp Pro Asn Ser Pro Val Gin Glu Thr Ser Ser Lys Gly Arg Cys Gin 
1045 1050 1055 

Gly Asp Lys Ser Val Phe Cys Arg Met Glu Val Leu Ser Arg Tyr Cys 
1060 1065 1070 

Ser He Pro Gly Tyr Asn Lys Leu Cys Cys Lys Ser Cys Asn Pro His 
1075 1080 1085 



Asp Asn Leu Thr Asp Val Asp Asp Arg Ala Glu Pro Pro Ser Gly Lys 
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1090 ' 1095 1100 

His Asn Asp lie Glu Glu Leu Met Pro Thr Leu Ser Val Pro Thr Leu 
1105 1110 1115 1120 

Val Met Glu Val Gin Pro Pro Pro Gly lie Pro Leu Glu Val « Pro Leu 
1125 1130 1135 

Asn Thr Ser Ser Thr Asn Ala Thr Glu Asp His Pro Glu Thr Asn Ala 
1140 1145 1150 

Val Asp Val Pro Tyr Lys lie Pro Gly Leu Glu Asp Glu Val Gin Pro 
1155 1160 1165 

Pro Asn Leu lie Pro Arg Arg Pro Ser Pro Tyr Glu Lys Thr Arg Asn 
1170 1175 1180 

Gin Arg lie Gin Glu Leu lie Asp Glu Met Arg Lys Lys Glu Met Leu 
1185 1190 1195 1200 

Gly Lys Phe 



<210> 6 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Asp Asp Gly Trp Ser Pro Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser 
15 10 15 

Cys Gly Asn Gly lie Gin Gin Arg Gly Arg Ser Cys Asp Ser Leu Asn 
20 25 30 

Asn Arg Cys Glu Gly Ser Ser Val Gin Thr Arg Thr Cys His lie Gin 
35 40 45 

Glu Cys 
50 



<210> 7 

<211> 57 

<212> PRT 

<213> Homo sapiens 



<400> 7 

Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser Val Thr 
15 10 15 
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Cys Gly Asp Gly Val He Thr Arg He Arg Leu Cys Asn Ser Pro Ser 

20 25 30 

Pro Gin Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu Thr Lys 

35 40 45 

Ala Cys Lys Lys Asp Ala Cys Pro He 
50 55 



<210> 8 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Asn Gly Gly Trp Gly Pro Trp Ser Pro Trp Asp He Cys Ser Val Thr 
15 10 15 

Cys Gly Gly Gly Val Gin Lys Arg Ser Arg Leu Cys Asn Asn Pro Thr 
20 25 30 

Pro Gin Phe Gly Gly Lys Asp Cys Val Gly Asp Val Thr Glu Asn Gin 
35 40 " 45 

He Cys Asn Lys Gin Asp Cys Pro He 
50 55 



<210> 9 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Glu Glu Gly Trp Ser Pro Trp Ala Glu Trp Thr Gin Cys Ser Val Thr 
1 5 10 15 

Cys Gly Ser Gly Thr Gin Gin Arg Gly Arg Ser Cys Asp Val Thr Ser 
20 25 30 

Asn Thr Cys Leu Gly Pro Ser He Gin Thr Arg Ala Cys Ser Leu Ser 
35 40 45 

Lys Cys 
50 



<210> 10 
<211> 57 
<212> PRT 

<213> Homo sapiens 
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<400> 10 

Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser Val Thr 
15 10 15 

Cys Gly Val Gly Asn lie Thr Arg He Arg Leu Cys Asn Ser Pro Val 
20 25 30 

Pro Gin Met Gly Gly Lys Asn Cys Lys Gly Ser Gly Arg Glu Thr Lys 
35 ■ 40 45 

Ala Cys Gin Gly Ala Pro Cys Pro He 
50 55 



<210> 11 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Asp Gly Arg Trp Ser Pro Trp Ser Pro Trp Ser Ala Cys Thr Val Thr 
1 5 10 15 

Cys Ala Gly Gly He Arg Glu Arg Thr Arg Val Cys Asn Ser Pro Glu 
20 25 30 

Pro Gin Tyr Gly Gly Lys Ala Cys Val Gly Asp Val Gin Glu Arg Gin 
35 40 45 

Met Cys Asn Lys Arg Ser Cys Pro 
50 55 



<210> 12 

<211> 3974 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ggtacctaag tgagtagggc gtccgatcga cggacgcctt ttttttgaat tcgtaatcat 60 

ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 120 

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 180 

cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 24 0 

tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 300 

ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 360 
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taatacggtt atccacagaa tcaggggata 
agcaaaaggc caggaaccgt aaaaaggccg 
cccctgacga gcatcacaaa aatcgacgct 
tataaagata ccaggcgttt ccccctggaa 
tgccgcttac cggatacctg tccgcctttc 
gctcacgctg taggtatctc agttcggtgt 
acgaaccccc cgttcagccc gaccgctgcg 
acccggtaag acacgactta tcgccactgg 
cgaggtatgt aggcggtgct acagagttct 
gaagaacagt atttggtatc tgcgctctgc 
gtagctcttg atccggcaaa caaaccaccg 
agcagattac gcgcagaaaa aaaggatctc 
ctgacgctca gtggaacgaa aactcacgtt 
caattcgcgc gcgaaggcga agcggcatgc 
acctttcgcg gtatggcatg atagcgcccg 
aaaccagtaa cgttatacga tgtcgcagag 
cgcgtggtga accaggccag ccacgtttct 
atggcggagc tgaattacat tcccaaccgc 
ttgctgattg gcgttgccac ctccagtctg 
gcgattaaat ctcgcgccga tcaactgggt 
agcggcgtcg aagcctgtaa agcggcggtg 
ctgatcatta actatccgct ggatgaccag 
aatgttccgg cgttatttct tgatgtctct 
tcccatgaag acggtacgcg actgggcgtg 
atcgcgctgt tagcgggccc attaagttct 
cataaatatc tcactcgcaa tcaaattcag 
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acgcaggaaa gaacatgtga gcaaaaggcc 420 
cgttgctggc gtttttccat aggctccgcc 480 
caagtcagag gtggcgaaac ccgacaggac 54 0 
gctccctcgt gcgctctcct gttccgaccc 600 
tcccttcggg aagcgtggcg ctttctcata 660 
aggtcgttcg ctccaagctg ggctgtgtgc 720 
ccttatccgg taactatcgt cttgagtcca 780 
cagcagccac tggtaacagg attagcagag 84 0 
tgaagtggtg gcctaactac ggctacacta 900 
tgaagccagt taccttcgga aaaagagttg 960 
ctggtagcgg tggttttttt gtttgcaagc 1020 
aagaagatcc tttgatcttt tctacggggt 1080 
aagggatttt ggtcatgaga ttatcgtcga 114 0 
atttacgttg acaccatcga atggtgcaaa 1200 
gaagagagtc aattcagggt ggtgaatgtg 1260 
tatgccggtg tctcttatca gaccgtttcc 1320 
gcgaaaacgc gggaaaaagt ggaagcggcg 1380 
gtggcacaac aactggcggg caaacagtcg 14 4 0 
gccctgcacg cgccgtcgca aattgtcgcg 1500 
gccagcgtgg tggtgtcgat ggtagaacga 1560 
cacaatcttc tcgcgcaacg cgtcagtggg 1620 
gatgccattg ctgtggaagc tgcctgcact 1680 
gaccagacac ccatcaacag tattattttc 1740 
gagcatctgg tcgcattggg tcaccagcaa 1800 
gtctcggcgc gtctgcgtct ggctggctgg 1860 
ccgatagcgg aacgggaagg cgactggagt 1920 
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gccatgtccg gttttcaaca aaccatgcaa atgctgaatg agggcatcgt tcccactgcg 1980 

atgctggttg ccaacgatca gatggcgctg ggcgcaatgc gcgccattac cgagtccggg 2040 

ctgcgcgttg gtgcggatat ctcggtagtg ggatacgacg ataccgaaga cagctcatgt 2100 

tatatcccgc cgttaaccac catcaaacag gattttcgcc tgctggggca aaccagcgtg 2160 

gaccgcttgc tgcaactctc tcagggccag gcggtgaagg gcaatcagct gttgcccgtc 2220 

tcactggtga aaagaaaaac caccctggcg cccaatacgc aaaccgcctc tccccgcgcg 2280 

ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag cgggcagtga 2340 

gcgcaacgca attaatgtaa gttagcgcga attgtcgacc aaagcggcca tcgtgcctcc 2400 

ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc 24 60 

gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa 2520 

ccaactcgcg aggggatcga gcccggggtg ggcgaagaac tccagcatga gatccccgcg 2580 

ctggaggatc atccagccgg cgtcccggaa aacgattccg aagcccaacc ttt cat agaa 2 64 0 

ggcggcggtg gaatcgaaat ctcgtgatgg caggttgggc gtcgcttggt cggtcatttc 2700 

gaaccccaga gtcccgctca gaagaactcg tcaagaaggc gatagaaggc gatgcgctgc 27 60 

gaatcgggag cggcgatacc gtaaagcacg aggaagcggt cagcccattc gccgccaagc 2820 

tcttcagcaa tatcacgggt agccaacgct atgtcctgat agcggtccgc cacacccagc 2880 

cggccacagt cgatgaatcc agaaaagcgg ccattttcca ccatgatatt cggcaagcag 2940 

gcatcgccat gggtcacgac gagatcctcg ccgtcgggca tgcgcgcctt gagcctggcg 3000 

aacagttcgg ctggcgcgag cccctgatgc tcttcgtcca gatcatcctg atcgacaaga 3060 

ccggcttcca tccgagtacg tgctcgctcg atgcgatgtt tcgcttggtg gtcgaatggg 3120 

caggtagccg gatcaagcgt atgcagccgc cgcattgcat cagccatgat ggatactttc 31*80 

tcggcaggag caaggtgaga tgacaggaga tcctgccccg gcacttcgcc caatagcagc 3240 

cagtcccttc ccgcttcagt gacaacgtcg agcacagctg cgcaaggaac gcccgtcgtg 3300 

gccagccacg atagccgcgc tgcctcgtcc tgcagttcat tcagggcacc ggacaggtcg 3360 

gtcttgacaa aaagaaccgg gcgcccctgc gctgacagcc ggaacacggc ggcatcagag 3420 

cagccgattg tctgttgtgc ccagtcatag ccgaatagcc tctccaccca agcggccgga 3480 
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gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tgtctcttga 3540 
tcagatcttg atcccctgcg ccatcagatc cttggcggca agaaagccat ccagtttact 3600 
ttgcagggct tcccaacctt accagagggc gccccagctg gcaattccgg ttcgcttgct 3660 
gtccataaaa ccgcccagtc tagctatcgc catgtaagcc cactgcaagc tacctgcttt 3720 
ctctttgcgc ttgcgttttc ccttgtccag atagcccagt agctgacatt catccggggt 3780 
cagcaccgtt tctgcggact ggctttctac gtgttccgct tcctttagca gcccttgcgc 384 0 
cctgagtgct tgcggcagcg tgaagcttaa aaaactgcaa aaaatagttt gacttgtgag 3900 
cggataacaa ttaagatgta cccaattgtg agcggataac aatttcacac attaaagagg 3960 
agaaattaca tatg 3974 



<210> 13 

<211> 112 

<212> ONA 

<213> Homo sapiens 

<400> 13 

aagcttaaaa aactgcaaaa aatagtttga cttgtgagcg gataacaatt aagatgtacc 60 
caattgtgag cggataacaa tttcacacat taaagaggag aaattacata tg 112 



<210> 14 

<211> 542 

<212> DNA 

<213> Mus musculus 

<220> 

<221> UNSURE 
<222> (3) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (21) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (22) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (361) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (369) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (407) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (427) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (479) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (482) 

<223> May be. any„ nucleic _ acid 
<220> 

<221> UNSURE 
<222> (535) 

<223> May be any nucleic acid 
<400> 14 

gtncgaattt cggcacgaga nnttagacgc cttttcatgg aagctgggga atgtgggggc 60 
cttggggaga ctgttcgaga acgtgcggtg gaggagtcca gtacacgatg agggaatgtg 120 
acaacccagt cccaaagaat ggagggaagt actgtgaagg caaacgagtg cgctacagat 180 
cctgtaacct tgaggactgt ccagacaata atggaaaaac ctttagagag gaacaatgtg 24 0 
aagcacacaa cgagttttca aaagcttcct ttgggagtgg gcctgcggtg gaatggattc 300 
ccaagtacgc tggcgtctca ccaaaggaca ggtgcaagtt catgttgcca agccaaaggc 360 
nttggctant tctttcgttt tgcagcccaa ggttgttagg tgggtantcc atgttaggcc 420 
cagattncac ctttgtctgt gtgcaaggac agtgtgttaa aagttggttg tgatccgcnt 480 
cntagattcc aaaaggagtt ttgttaatgt ggtgttttcn gggggaatgg tctantttta 54 0 

542 



<210> 15 
<211> 320 
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<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<400> 15 

cagagaacat tcgccccact cttcaatgac ccatgctgaa aaagtgggga tagcattgaa 60 
agattccttc ttcttcttta cgaagtaggt gtatttaatt ttaggtcgaa gggcattgcc 120 
cacagtaaga acctggatgg tcaagggctc tttgagaggg ctaaagctgc gaattctttc 180 
caatgccgca gaggagccgc tgtacctcaa gacaacacct ttgtacataa tgtcttgctc 240 
taaggtggac aaagtgtagt caccattaag aatatatgtg ccatcagcag ctttgatggc 300 
aagaaagctg cccttgttcc 320 

<210> 16 
<211> 316 
<212> DNA 

<213> Eimeria tenella 
<400> 16 

aatgccgaga cattaatgga cagcctgctt ccgagtgtgc aaaggaagtg aagccagcca 60 
gcaccagacc ttgtgcagac catccctgcc cccagtggca gctgggggag tggtcatcat 120 
gttctaagac ctgtgggaag ggttacaaaa aaagaagctt gaagtgtctg tcccatgatg 180 
gaggggtgtt atctcatgag agctgtgatc ctttaaagaa acctaaacat ttcatagact 240 
tttgcacaat ggcagaatgc agttaagtgg tttaagtggt gttagctttg agggcaaggc 300 
aaagtgagga agggct 316 



<210> 17 
<211> 383 
<212> DNA 

<213> Caenorhabditis elegans 
<220> 

<221> UNSURE 
<222> (160) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (326) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (358) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (366) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (377) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (379) 

<223> May be any nucleic acid 
<400> 17 

gtcgacccac gcgtccggat ggtactccat gtagcccaga ttccacctct gtctgtgtgc 60 
aaggacagtg tgtaaaagct ggttgtgatc gcatcataga ctccaaaaag aagtttgata 120 
aatgtggtgt ttgcggggga aatggatcta cttgtaaaan aatatcagga tcagttacta 180 
gtgcaaaacc tgggatatca tgatatcatc acaattccaa ctgggagcca ccaacatcga 240 
agtgaaacag cggaaccaga ggggatccag ggaacaatgg gcagctttct tgccatcaaa 300 
gctgctggat ggcacatata ttcttnaatg gtgactacac tttgtccacc ttagaganag 360 
acattntgtg acaaagngnt tgt 383 



<210> 18 
<211> 404 
<212> DNA 

<213> Crotalus atrox 
<220> 

<221> UNSURE 
<222> (21) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (301) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (335) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (373) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (378) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (382) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (383) 

<223> May be any nucleic acid 
<400> 18 

cccacgcgtc cgcccacggt nccgggactt gtgtgggtcc cagacatgtg atactcttgg 60 
gatggctgat gttggaactg tgtgtgatcc gagcagaagc tgctccgtca tagaagatga 120 
tggtttacaa gctgccttca ccacagccca tgaattaggc cacgtgttta acatgccaca 180 
tgatggatgc aaagcagtgt gccagcctta aatggtgtga accagggatt cccacatgat 24 0 
ggcgtcaatg ctttccaacc tgggaccaca gccagccttg ggtcctcctt gcagtggcct 300 
nacatggatt gacatcattt ctgggatgaa tggtncatgg gggaatgttt tgattggaca 360 
agccttcaga atnccctnac annttcccag gggttctccc tggg 404 



<210> 19 
<211> 152 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (105) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (122) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (135) 
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<223> May be any nucleic acid 
<400> 19 

atcgtagaag atgaaaaatg gggcccagag gtgtccgaca atggggggct tacactgcgt 60 
aacttctgca actggcagcg gcgtttcaac cagcccagcg accgncaccc agagcactac 120 
gncacggcca tcctnctcac cagacagaac tt 152 

<210> 20 
<211> 4180 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 20 

gcagctccga gctaggtgct atcgcaaggc cagagcgcac agcccggcgg agagagcaga 60 
tccttgctca gatcgagtca aatcgggcca aggcggagga cgaagagtcc aggctcctat 120 . . 
tctggacttg ttccccagct ccgggggcgc ttctaggtcc tgcagcagcc agcagtgcgg 180 
agccaccaac tcggtgctgg aatgaaaaaa ttcccgcgcg ccagtgcaga atctttctaa 24 0 
gtgacccgga gcttcgggtg ctagctctgc acgaactttc ccatcaaagt gatcgtgaat 300 
tttaagcatc aggagcaggc cagcgaagct ctacgcgtct aaacgtctat ccagaccaag 360 
agttctctgc ggtgcagggt gcggtgccat gcagccaaaa gtcccttttg gggtcacgca 420 
agcagaagcc ctgctccgac atgggggacg tccagcgggc agcgagatct cggggctctc 480 
tgtccgcaca catgctgttg ctgctcctcg cttccataac aatgctgcta tgtgcgcggg 540 
gcgcacacgg gcgccccacg gaggaagatg aggagctggt cctgccctcg ctggagcgcg 600 
ccccgggcca cgattccacc accacacgcc ttcgtctgga cgcctttggc cagcagctac 660 
atctgaagtt gcagccggac agcggtttct tggcgcctgg cttcaccctg cagactgtgg 720 
ggcgcagtcc cgggtccgag gcacaacatc tggaccccac cggggacctg gctcactgct 780 
tctactctgg cacggtgaac ggtgatcccg gctctgccgc agccctcagc ctctgtgaag 840 
gtgtgcgtgg tgccttctac ctacaaggag aggagttctt cattcagcca gcgcctggag 900 
tggccaccga gcgcctggcc cctgccgtgc ccgaggagga gtcatccgca cggccgcagt 960 
tccacatcct gaggcgaagg cggcggggca gtggcggcgc caagtgcggc gtcatggacg 1020 
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acgagaccct gccaaccagc gactcgcgac ccgagagcca gaacacccgg aaccagtggc 1080 
ctgtgcggga ccccacgcct caggacgcgg gaaagccatc aggaccagga agcataagga 1140 
agaagcgatt tgtgtccagc ccccgttatg tggaaaccat gctcgtagct gaccagtcca 1200 
tgg.ccgactt ccacggcagc ggtctaaagc attaccttct aaccctgttc tcggtggcag 12 60 
ccaggtttta caagcatccc agcattagga attcaattag cctggtggtg gtgaagatct 1320 
tggtcatata cgaggagcag aagggaccag aagttacctc caatgcagct ctcacccttc 1380 
ggaatttctg cagctggcag aaacaacaca acagccccag tgaccgggat ccagagcact 14 40 
atgacactgc aattctgttc accagacagg atttatgtgg ctcccacacg tgtgacactc 1500 
tcggaatggc agatgttgga accgtatgtg accccagcag gagctgctca gtcatagaag 15 60 
atgatggttt gcaagccgcc ttcaccacag cccatgaatt gggccatgtg tttaacatgc 1620 
cgcacgatga tgctaagcac tgtgccagct tgaatggtgt gagtggcgat tctcatctga 1680 
tggcctcgat gctctccagc ttagaccata gccagccctg gtcaccttgc agtgcctaca 17 40 
tggtcacgtc cttcctagat aatggacacg gggaatgttt gatggacaag ccccagaatc 1800 
caatcaagct cccttctgat cttcccggta ccttgtacga tgccaaccgc cagtgtcagt 1860 
ttacattcgg agaggaatcc aagcactgcc ctgatgcagc cagcacatgt actaccctgt 1920 
ggtgcactgg cacctccggt ggcttactgg tgtgccaaac aaaacacttc ccttgggcag 1980 
atggcaccag ctgtggagaa gggaagtggt gtgtcagtgg caagtgcgtg aacaagacag 204 0 
acatgaagca ttttgctact cctgttcatg gaagctgggg accatgggga ccgtggggag 2100 
actgctcaag aacctgtggt ggtggagttc aatacacaat gagagaatgt gacaacccag 2160 
tcccaaagaa cggagggaag tactgtgaag gcaaacgagt ccgctacagg tcctgtaaca 2220 
tcgaggactg tccagacaat aacggaaaaa cgttcagaga ggagcagtgc gaggcgcaca 2280 
atgagttttc caaagcttcc tttgggaatg agcccactgt agagtggaca cccaagtacg 2340 
ccggcgtctc gccaaaggac aggtgcaagc tcacctgtga agccaaaggc attggctact 24 00 
ttttcgtctt acagcccaag gttgtagatg gcactccctg tagtccagac tctacctctg 24 60 
tctgtgtgca agggcagtgt gtgaaagctg gctgtgatcg catcatagac tccaaaaaga 2520 
agtttgataa gtgtggcgtt tgtggaggaa acggttccac atgcaagaag atgtcaggaa 2580 
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tagtcactag tacaagacct gggtatcatg acattgtcac aattcctgct ggagccacca 2640 
acattgaagt gaaacatcgg aatcaaaggg ggtccagaaa caatggcagc tttctggcta 2700 
ttagagccgc tgatggtacc tatattctga atggaaactt cactctgtcc acactagagc 27 60 
aagacctcac ctacaaaggt actgtcttaa ggtacagtgg ttcctcggct gcgctggaaa 2820 
gaatccgcag ctttagtcca ctcaaagaac ccttaaccat ccaggttctt atggtaggcc 2880 
atgctctccg acccaaaatt aaattcacct actttatgaa gaagaagaca gagtcattca 2940 
acgccattcc cacattttct gagtgggtga ttgaagagtg gggggagtgc tccaagacat 3000 
gcggctcagg ttggcagaga agagtagtgc agtgcagaga cattaacgga caccctgctt 3060 
ccgaatgtgc aaaggaagtg aagccagcca gtaccagacc ttgtgcagac cttccttgcc 3120 
cacactggca ggtgggggat tggtcaccat gttccaaaac ttgcgggaag ggttacaaga 3180 
agagaacctt gaaatgtgtg tcccacgatg ggggcgtgtt atcaaatgag agctgtgatc 324 0 
ctttgaagaa gccaaagcat tacattgact tttgcacact gacacagtgc agttaagagg 3300 
cgttagagga caaggtagcg tggggagggg ctgatacact gagtgcaaga gtactggagg 3360 
gatccagtga gtcaaaccag taagcagtga ggtgtggcaa ggaggtgtgt gtaggggata 3420 
catagcaaag gaggtagatc aggacactac cctgccagtt acattctgat aaggtagtta 3480 
atgaggcaca gtagcatctg aaagaccata cagagcacta aggagcccca aagcactatt 354 0 
agtatctctt ttcttatatc tatcgcccaa ataattttca gagtctggca gaagccctgt 3600 
tgcactgtac taactagata cttcttatca caaagattgg gaaaggcaaa gcagaaagat 3660 
ggtaagactg ggtttcaaac aaggcttggt ttcaatcact ggaggcaagg aggaggggac 3720 
aaacaagatc attattcgaa gtcgctggtt gctgtggttt tacggaaggt tgatgcatca 3780 
ttcctatcaa cagtgaaaag ttcagcttgt tcaacgtgac agaaaggctc atctccgtga 3840 
aagagctcct gatttcttct tacaccatct cagttcttaa ctatagttca tgttgaggta 3900 
gaaacaattc atctatttat aaaatgtaca ttggaaaaaa aaagtgaagt ttatgaggta 3960 
cacataaaaa ctgaaggaaa caatgagcaa catgcctcct gctttgcttc ctcctgaggt 4020 
aaacctgcct ggggattgag gttgtttaag attatccatg gctcacaaga ggcagtaaaa 4080 
taatacatgt tgtgccagag ttagaatggg gtatagagat cagggtccca tgagatgggg 4140 
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aacatggtga tcactcatct cacatgggag gctgctgcag 4180 

<210> 21 

<211> 9248 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 21 

gcagctccga gctaggtgct atcgcaaggc cagagcgcac agcccggcgg agagagcaga 60 
tccttgctca gatcgagtca aatcggggcc aaggcggagg acgaagagtc caggctccta 120 
ttctggactt gttccccagc tccgggggcg cttctaggtc ctgcagcagc caggagtgcg 1B0 
gagccaccaa ctcggtgctg gaatgaaaaa attcccgcgc gccagtgcag aatctttcta 24 0 
agtgacccgg agcttcgggt gctagctctg cacgaacttt cccatcaaag tgatcgtgaa 300 
ttttaagcat caggagcagg ccagcgaagc tctacgcgtc taaacgtcta tccagaccaa 360 
gagttctctg cggtgcaggg tgcggtgcca tgcagccaaa agtccctttg gggtcacgca 420 
agcagaagcc ctgctccgac atgggggacg tccagcgggc agcgagatct cggggctctc 4 80 
tgtccgcaca catgctgttg ctgctcctcg cttccataac aatgctgcta tgtgcgcggg 540 
gcgcacacgg gcgccccacg gaggaagatg aggagctggt cctgccctcg ctggagcgcg 600 
ccccgggcca cgattccacc accacacgcc ttcgtctgga cgcctttggc cagcagctac 660 
atctgaagtt gcagccggac agcggtttct tggcgcctgg cttcaccctg cagactgtgg 720 
ggcgcagtcc cgggtccgag gcacaacatc tggaccccac cggggacctg gctcactgct 780 
tctactctgg cacggtgaac ggtgatcccg gctctgccgc agccctcagc ctctgtgaag 84 0 
gtgtgcgtgg tgccttctac ctacaaggag aggagttctt cattcagcca gcgcctggag 900 
tggccaccga gcgcctggcc cctgccgtgc ccgaggagga gtcatccgca cggccgcagt 960 
tccacatcct gaggcgaagg cggcggggca gtggcggcgc caagtgcggc gtcatggacg 1020 
acgagaccct gccaaccagc gactcgcgac ccgagagcca. gaacacccgg aaccagtggc 1080 
ctgtgcggga ccccacgcct caggacgcgg gaaagccatc aggtataaga gtgaccccca 114 0 
tctctcagtc tttacgaggc gtgacttggg gtcacactcc agatcgcctc taaatgcgaa 1200 
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tgactcagac ttgcagtgaa ttgaagttct gggtcgtgac cttcccgctc cccccccccc 1260 

aaaaaaagtg tgaccatact ctgctagaac acttatttgc ccgaatagtt aataatttga 1320 

gaaagagaga aagaatcgga ggtcctgtag ataagggcta agcgtttcct ccgcgaagcc 1380 

aataacccga ctccttacac tggagaatct ctctccatcc ctttaatgcc tttagtgaat 1440 

gtatgagttc actttaacta ggttgtagtt tcgcgctgag ttttgtaacg tcagtccgtg 1500 

tgagcacgta gcgctcaaag gagggcggag tagaggagcc atggtgacct ggatgtgcgt 1560 

tcaggagcct gggcaacggc agtggtgatc tcatttctgt ggccttccgt ctgtcccctt 1620 

cccccatttg aaaagctgac cccgatggct ggtggctccg ttgggcccct ctgcagaacc 1680 

tgcttgggag gtctttgctt ggttcgcccc gcctccacgc gcctcctacc tcggcctcgt 174 0 

tgctcgcact ccctctcccg gcagaggttg gactccccag cgctgtggaa tgttagcctg 1800 

gactgatcct ccctgctaca cattcgcctg actctgccgt gttcagtctc taccagccag 1860 

ttagttcttt ttaatcattc aaatttcttt ttgccctttt ctagatttct ccctcttttc 1920 

cgacttgtcc ctaggagctg gtattcatat cctactttac gatttctctg accgctgagt 1980 

ctcagcagcc cgaaaaaggc cattttccaa attggcaacc ctggtttgag aaaggaactt 2040 

attccccccg gggcactggg agtgagagga ggcaggaaaa cactgctggg cagagtgggt 2100 

ggtcctagtg cccggaactg gatcaagcag agaaccccct gggacccctt gaatgagaga 2160 

gctgagcctt acagactgag actcctcaag ccccacccct tggctgagct ccccgccctg 2220 

ccccatgcct tccacgtgga gctggatgat ctcattcggg atttcagccc tggcttcaat 2280 

agtgaaaggg tgactcaggg cgtccgcctg cttctcttgc caagttttta ctacagctgg 23 4 0 

gtagaaatga tagccatact gcctcactca ggctgtggag tcttcaaaga ccacaaaaga 2400 

aatctgcgga cacatatata gacagtttga tcactctgtt gcttgctttg ttttgttttg 24 60 

ttttgtctta tttaaagcaa aagaaaaaag acttaaaaat aactcacagt ttttagaaga 2520 

tgcaaatatt tgttttattt ttgttccagg tgtatttcag ttttatttac tttgactagg 2580 

ttgactttcc taatataccc cgagaaggtc actattagga gaaggactgc ccatgagcaa 2640 

acttcctttt ctttttacag gaccaggaag cataaggaag aagcgatttg tgtccagccc 2700 

ccgttatgtg gaaaccatgc tcgtggctga ccagtccatg gccgacttcc acggcagcgg 27 60 
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tctaaagcat taccttctaa ccctgttctc ggtggcagcc aggttttaca agcatcccag 2820 
cattaggaat tcaattagcc tggtggtggt gaagatcttg gtcatatatg aggagcagaa 2880 
gggaccagaa gttacctcca atgcagctct cacccttcgg aatttctgca actggcagaa 294 0 
acaacacaac agccccagtg accgggatcc agagcactat gacactgcaa ttctgttcac 3000 
cagacaggta agacaggagc ttatcaacca tttcatcaac tcaactcgga ggtcagcctt 3060 
gtgttggatg ggatgagagg gtgggggtgt ggcggagagg aaacccagaa ggggatgaca 3120 
tttgaaatgt aaacaaaata accaattaaa aaaaaaaggc atctcatctg tattgcctca 3180 
tttcctttcg gttataggct agctcaatct gtcttgctta tttctatttt aaacttccac 3240 
atctcaagtt ctacagttct attttaaaag cattacaggg aatcttgctt agagtcagtc 3300 
cttcaagccc agcaataatg aatggacagg cttcaaagtg catgtgaaga cacgcccaac 3360 
tgaagagcta agtatcactc tctcctactt aaaagggatt tcccttgcct ctttgtagga 34 20 
tttatgtggc tcccacacgt gtgacactct cgggatggca gatgttggaa ctgtatgtga 34 80 
ccccagcagg agctgctcag tcatagaaga tgatggtttg caagccgcct tcaccacagc 354 0 
ccacgaattg ggtaagtcgg cttcagagta caagttaagc ccaaatgcat ggatacaacc 3600 
caataagtca atctgatgtg acgagagaga aaacatctca gactatgttg ctacctcagc 3660 
caccagcaat tttagaaggg gtagggtata ttttccacga tttcaagtat ggtcttacta 3720 
ggacaggaga aagtggtaca aacatttgaa cgttgacatt tttatacttg ccctgatcaa 3780 
agtgagtatg agccccaata caggttgtct aataagagag ccattgagcc tcactcaata 3840 
atacagctga atgtccttct tgtctgcttc ccaggccatg tgtttaacat gccgcacgat 3900 
gatgctaagc actgtgccag cttgaatggt gtgactggcg attctcatct gatggcctcg 3960 
atgctctcca gcttagacca tagccagccc tggtcacctt gcagtgccta catggtcacg 4 020 
tccttcctag ataatggaca cggtaagatg acagctcctc tttccagatg gtgttcaacc 4080 
ttccttgtgt agggctctct ctggctaagt gagctccatg gctcttgctc atttcccctc 4140 
cttcagagtt ttctctggca ggatcataag tagtagatct ttacctccat tgcatcctgc 4200 
tcccaaagtc cattcattca taaacaataa cttctcgcca ttgtaaaatc agaagtcccc 4260 
tattgaggat aacgtctcga taaaaatcta aagttcccta gcattgattt tcccaaaaat 4320 
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gcatgatttc accaaacatg tattaataat tgcctctttt ttcttttcct tttttttttt 4380 
tattatttta ggggaatgtt tgatggacaa gccccagaat ccaatcaagc tcccttctga 4440 
tcttcccggt accttgtacg atgccaaccg ccagtgtcag tttacattcg gagaggaatc 4500 
caagcactgc cctgatgcag ccagcacatg tactaccctg tggtgcactg gcacctccgg 4 560 
tggcttactg gtgtgccaaa caaaacactt cccttgggca gatggcacca gctgtggaga 4 620 
agggaagtgg tgtgtcagtg gcaagtgcgt gaacaagaca gacatgaagc attttgctgt 4 680 
gagttttccc aatgaaacat atccgtttgc aactcagggt tgagaagggc aaagtgatgg 4740 
tttagttcct ttcctagaca aactcctcta cctgtgtcct gtagtgggac tatgagatgg 4 800 
tagcgtattt tgagaattga ttgtctgttt tacatttttc tctgattccc taaaatgtct 4860 
ttatagttct aacactgata tctgtatctc catttagact cctgttcatg gaagctgggg 4 920 
accatgggga ccgtggggag actgctcaag aacctgtggt ggtggagttc aatacacaat 4 980 
gagagaatgt gacaacccag tcccaaagaa cggagggaag tactgtgaag gcaaacgagt 5040 
ccgctacagg tcctgtaaca tcgaggactg tccagacaat aacggtgagt catactggac 5100 
ttcagctctc agaaaccggg caaaggcggc gtgccacaac atgtggttgg aagttggaaa 5160 
ctgggaacat catcgccgtc gttctctttt caggaaaaac gttcagagag gagcagtgcg 5220 
aggcgcacaa tgagttttcc aaagcttcct ttgggaatga gcccactgta gagtggacac 5280 
ccaagtacgc cggcgtctcg ccaaaggaca ggtgcaagct cacctgtgaa gccaaaggca 534 0 
ttggctactt tttcgtctta cagcccaagg taggtgcttt tacacttgaa tctttgcaaa 54 00 
ggagcctcag ctgggcttgc tgccatgcca tacaaatgtt tgggctgtct ttacctattg 54 60 
atctgtgttc cgttttgaat ttggaatact tctaaatgca ggaacaactc cttgctttgg 5520 
gatttgttgt tgccttctgt tgggaaggaa gcttaaatct agctagcact taaaagagtc 5580 
ttgcatgtgt ttaatattgc ttctctatcc ccaaagaatg gccctttgaa aactcaagag 564 0 
ccctctctgt ataactaggt ttcacataca aaaattcatg gttagataaa ttatatatta 5700 
acatggcacc caggagtttt agaaagtagt ccaaagtact tgttactggg tacctagcag 57 60 
ccgcacatac ga'gcacacta actaaggtaa gagtttgaga attaaaaatt catcgttgga 5820 
acatgtactt tgaccaaaga gactcgccat ttcttttggt gttttgcaga aaggataaat 58 80 
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cctgctttga agaagaaaat tgaatgaaat 
tataagatag caaactatat ccaagttgtg 
taccttcaac tacagcatat tattctaggt 
gtgaaccctg cttccattga cctaggccag 
cttacagata gctatatgaa ctgttgaaaa 
gacaaggaga ctacttgtag cctgggagga 
tttaaccacc atggtgcatg ttgtctccac 
actctacctc tgtctgtgtg caagggcagt 
actccaaaaa gaagtttgat aagtgtggcg 
agatgtcagg aatagtcact agtacaaggt 
cacgctgtgt tgctcagttg gtccctagca 
atgtacctgt agtcccagtg tatgggagac 
tcaggtcatc tttgctacat agtgactttc 
ctggaaacac aaacacacac aaaacaatta 
agtaggctga cttgggagga tggttgctat 
agttccagta taacctggcc cacaagtgaa 
aaaacacatg ctttggagac cctgtaattt 
gtcacattta aaagaattgc ctatttttaa 
catatacctg ttttctttgt ccacagacct 
ggagccacca acattgaagt gaaacatcgg 
tttctggcta ttagagccgc tgatggtacc 
acactagagc aagacctcac ctacaaaggt 
gcgctggaga gaatccgcag ctttagtcca 
atggtaggcc atgctctccg acccaaaatt 
gagtcattca acgccattcc cacattttct 
tccaagacat gcggctcagg "ttggcagaga 
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ttgcttaagc ttgtcatgta ttcttagcat 5940 
gatgaagtat ttagcaagtg atttataaag 6000 
actgaccatg gaacaataat cagtgtgaca 6060 
caaatatata aaatcaagac atttataagc 6120 
agccaaaatg aaagtgaaca tgtggcacgt 6180 
gagcattccc agttgccatc acatcagatg 624 0 
aggttgtaga tggcactccc tgtagtccag 6300 
gtgtgaaagc tggctgtgat cgcatcatag 6360 
tttgtggagg aaacggttcc acatgcaaga 6420 
gagtttcaga acgctcactt ctgcagtaga 6480 
tctacaagac cttgggttca atccgcatgc 654 0 
agagacaagt gtgacaagac ggtcagatgt 6600 
agttcacctt ggggaacatg aaaaacctga 6660 
acccaggtac ttcatgtaat cccagtgttc 6720 
aaggcctagg ttagcttggt ctacataatg 6780 
ccctaaagtt aattaatcga cacatgaaac 68 40 
tgatatacga ttttgtagga ctaaggaaaa 6900 
agcaatgtga ttgattaact cattgaaaga 6960 
gggtatcatg acattgtcac aattcctgct 7020 
aatcaaaggg ggtccagaaa caatggcagc 7080 
tatattctga atggaaactt cactctgtcc 7140 
actgtcttaa ggtacagtgg ttcctcggct 7200 
ctcaaagaac ccttaaccat ccaggttctt 7260 
aaattcacct actttatgaa gaagaagaca 7320 
gagtgggtga ttgaagagtg gggggagtgc 7380 
agagtagtgc agtgcagaga cattaatgga 7440 
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caccctgctt ccgaatgtgc aaaggaagtg aagccagcca gtaccagacc ttgtgcagac 7500 

cttccttgcc cacactggca ggtgggggat tggtcaccat gttccaaaac ttgcgggaag 7560 

ggttacaaga agagaacctt gaaatgtgtg tcccacgatg ggggcgtgtt atcaaatgag 7620 

agctgtgatc ctttgaagaa gccaaagcat tacattgact tttgcacact gacacagtgc 7680 

agttaagagg cgttagagga caaggtagcg tggggagggg ctgatacact gagtgctgga 7740 

gggatccagt gagtcaaacc agtaagcagt gaggtgtggc aaggaggtgt gtgtagggga 7800 

tacatagcaa aggaggtaga tcaggacact accctgccag ttacattctg ataaggtagt 7860 

taatgaggca cagtagcatc tgaaagacca tacagagcac taaggagccc caaagcacta 7920 

ttagtatctc ttttcttata tctatcgccc aaataatttt cagagtctgg cagaagccct 7980 

gttgcactgt actgactaga tacttcttat cacaaagatt gggaaaggca aagcagaaag 8040 

atggtaagac tgggtttcaa acaaggcttg gtttctatca ctggaggcaa ggaggagggg 8100 

acaaacaaga tcattattcg aagtcgctgg ttgctgtggt tttacggaag gttgatgcat 8160 

cattcctatc aacagtgaaa agttcagctt gttcaacgtg acagaaaggc tcatctccgt 8220 

gaaagagctc ctgatttctt cttacaccat ctcagttctt aactataatt catgttgagg 8280 

tagaaacaat tcatctattt ataaaatgta cattggaaaa aaaaaagtga agtttatgag 8340 

gtacacataa aaactgaagg aaacaatgag caacatgcct cctgctttgc ttcctcctga 8400 

ggtaaacctg cctggggatt gaggttgttt aagattatcc atggctcaca agaggcagta 84 60 

aaataataca tgttgtgcca gagttagaat ggggtataga gatcagggtc ccatgagatg 8520 

gggaacatgg tgatcactca tctcacatgg gaggctgctg cagggtagca ggtccactcc 8580 

tggcagctgg tccaacagtc gtatcctggt gaatgtctgt tcagctcttc tactgagaga 864 0 

gaatatgact gtttccatat gtatatgtat atagtaaaat atgttactat gaattgcatg 8700 

tactttataa gtattggtgt gtctgttcct tctaagaagg actatagttt ataataaatg 8760 

cctataataa catatttatt tttatacatt tatttctaat gataaaacct ttaagttata 8820 

tcgcttttgt aaaagtgcat ataaaaatag agtatttata caatatatgt taactagaaa 8880 

taataaaaga acacttttga atgtgtatgc ctattttctg gagtgggatt aacttctggg 8940 

caagaaatct gatgagacac aaacattgga cttcaagaca gttttaaaat ttgggtaaat 9000 
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gaactgtatt tcctgtttat agacgtacta ataaaaaaga agttgatgat gtctttagtg 9060 

gtaagattgt tactaatgtg gttggcaaat tgctgtaaag agccagatag taagcattta 9120 

tggcattgta ggctatcttt cctgccacaa ccatgtgaca gtgagtgctt tgtaggactg 9180 

agagcagcca taaatgacat gtaaatgata aactgtggct gtgctttaat aaaactttat 924 0 
ttacaaaa 9248 



<210> 22 
<211> 5722 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<400> 22 
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gcctggggac 


taggcgtcct 


gttcctgatg 


catgtgtgtg 


gcaccaaccg 


cattccagag 


180 


tctggcggag 


acaacagcgt 


gtttgacatc 


tttgaactca 


ccggggccgc 


ccgcaagggg 


240 


tctgggcgcc 


gactggtgaa 


gggccccgac 


ccttccagcc 


cagctttccg 


catcgaggat 


300 


gccaacctga 


tcccccctgt 


gcctgatgac 


aagttccaag 


acctggtgga 


tgctgtgcgg 


360 


gcagaaaagg 


gtttcctcct 


tctggcatcc 


ctgaggcaga 


tgaagaagac 


ccggggcacg 


420 


ctgctggccc 


tggagcggaa 


agaccactct 


ggccaggtct 


tcagcgtggt 


gtccaatggc 


480 


aaggcgggca 


ccctggacct 


cagcctgacc 


gtccaaggaa 


agcagcacgt 


ggtgtctgtg 


540 


gaagaagctc 


tcctggcaac 


cggccagtgg 


aagagcatca 


ccctgtttgt 


gcaggaagac 


600 


agggcccagc 


tgtacatcga 


ctgtgaaaag 


atggagaatg 


ctgagttgga 


cgtccccatc 


660 


caaagcgtct 


tcaccagaga 


cctggccagc 


atcgccagac 


tccgcatcgc 


aaaggggggc 


720 


gtcaatgaca 


atttccaggg 


ggtgctgcag 


aatgtgaggt 


ttgtctttgg 


aaccacacca 


780 


gaagacatcc 


tcaggaacaa 


aggctgctcc 


agctctacca 


gtgtcctcct 


cacccttgac 


840 


aacaacgtgg 


tgaatggttc 


cagccctgcc 


atccgcacta 


actacattgg 


ccacaagaca 


900 


aaggacttgc 


aagccatctg 


cggcatctcc 


tgtgatgagc 


tgtccagcat 


ggtcctggaa 


960 
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ctcaggggcc tgcgcaccat tgtgaccacg ctgcaggaca gcatccgcaa agtgactgaa 1020 
gagaacaaag agttggccaa tgagctgagg cggcctcccc tatgctatca caacggagtt 1080 
cagtacagaa ataacgagga atggactgtt gatagctgca ctgagtgtca ctgtcagaac 1140 
tcagttacca tctgcaaaaa ggtgtcctgc cccatcatgc cctgctccaa tgccacagtt 1200 
cctgatggag aatgctgtcc tcgctgttgg cccagcgact ctgcggacga tggctggtct 1260 
ccatggtccg agtggacctc ctgttctacg agctgtggca atggaattca gcagcgcggc 1320 
cgctcctgcg atagcctcaa caaccgatgt gagggctcct cggtccagac acggacctgc 1380 
cacattcagg agtgtgacaa aagatttaaa caggatggtg gctggagcca ctggtccccg 1440 
tggtcatctt gttctgtgac atgtggtgat ggtgtgatca caaggatccg gctctgcaac 1500 
tctcccagcc cccagatgaa tgggaaaccc tgtgaaggcg aagcgcggga gaccaaagcc 1560 
tgcaagaaag acgcctgccc catcaatgga ggctggggtc cttggtcacc atgggacatc 1620 
tgttctgtca cctgtggagg aggggtacag aaacgtagtc gtctctgcaa caaccccgca 1680 
ccccagtttg gaggcaagga ctgcgttggt gatgtaacag aaaaccagat ctgcaacaag 1740 
caggactgtc caattgatgg atgcctgtcc aatccctgct ttgccggcgt gaagtgtact 1800 
agctaccctg atggcagctg gaaatgtggt gcttgtcccc ctggtztacag tggaaatggc 1860 
atccagtgca cagatgttga tgagtgcaaa gaagtgcctg atgcctgctt caaccacaat 1920 
ggagagcacc ggtgtgagaa cacggacccc ggctacaact gcctgccctg ccccccacgc 1980 
ttcaccggct cacagccctt cggccagggt gtcgaacatg ccacggccaa caaacaggtg 204 0 
tgcaagcccc gtaacccctg cacggatggg acccacgact gcaacaagaa cgccaagtgc 2100 
aactacctgg gccactatag cgaccccatg taccgctgcg agtgcaagcc tggctacgct 2160 
ggcaatggca tcatctgcgg ggaggacaca gacctggatg gctggcccaa tgagaacctg 2220 
gtgtgcgtgg ccaatgcgac ttaccactgc aaaaaggata attgccccaa ccttcccaac 2280 
tcagggcagg aagactatga caaggatgga attggtgatg cctgtgatga tgacgatgac 2340 
aatgataaaa ttccagatga cagggacaac tgtccattcc attacaaccc agctcagtat 2400 
gactatgaca gagatgatgt gggagaccgc tgtgacaact gtccctacaa ccacaaccca 24 60 
gatcaggcag acacagacaa caatggggaa ggagacgcct gtgctgcaga cattgatgga 2520 
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gacggtatcc tcaatgaacg ggacaactgc cagtacgtct acaatgtgga ccagagagac 2580 

actgatatgg atggggttgg agatcagtgt gacaattgcc ccttggaaca caatccggat 2640 

cagctggact ctgactcaga ccgcattgga gatacctgtg acaacaatca ggatattgat 2700 

gaagatggcc accagaacaa tctggacaac tgtccctatg tgcccaatgc caaccaggct 2760 

gaccatgaca aagatggcaa gggagatgcc tgtgaccacg atgatgacaa cgatggcatt 2820 

cctgatgaca aggacaactg cagactcgtg cccaatcccg accagaagga ctctgacggc 2880 

gatggtcgag gtgatgcctg caaagatgat tttgaccatg acagtgtgcc agacatcgat 2940 

gacatctgtc ctgagaatgt tgacatcagt gagaccgatt tccgccgatt ccagatgatt 3000 

cctctggacc ccaaagggac atcccaaaat gaccctaact gggttgtacg ccatcagggt 3060 

aaagaactcg tccagactgt caactgtgat cctggactcg ctgtaggtta tgatgagttt 3120 

aatgctgtgg acttcagtgg caccttcttc atcaacaccg aaagggacga tgactatgct 3180 

ggatttgtct ttggctacca gtccagcagc cgcttttatg ttgtgatgtg gaagcaagtc 324 0 

acccagtcct actgggacac caaccccacg agggctcagg gatactcggg cctttctgtg 3300 

aaagttgtaa actccaccac agggcctggc gagcacctgc ggaacgccct gtggcacaca 3360 

ggaaacaccc ctggccaggt gcgcaccctg tggcatgacc ctcgtcacat aggctggaaa 34 20 

gatttcaccg cctacagatg gcgtctcagc cacaggccaa agacgggttt cattagagtg 3480 

gtgatgtatg aagggaagaa aatcatggct gactcaggac ccatctatga taaaacctat 3540 

gctggtggta gactagggtt gtttgtcttc tctcaagaaa tggtgttctt ctctgacctg 3600 

aaatacgaat gtagagatcc ctaatcatca aattgttgat tgaaagactg atcataaacc 3660 

aatgctggta ttgcaccttc tggaactatg ggcttgagaa aacccccagg atcacttctc 3720 

cttggcttcc ttcttttctg tgcttgcatc' agtgtggact cctagaacgt gcgacctgcc 3780 

tcaagaaaat gcagttttca aaaacagact catcagcatt cagcctccaa tgaataagac 3840 

atcttccaag catataaaca attgctttgg tttccttttg aaaaagcatc tacttgcttc 3900 

agttgggaag gtgcccattc cactctgcct ttgtcacaga gcagggtgct attgtgaggc 3960 

catctctgag cagtggactc aaaagcattt tcaggcatgt cagagaaggg aggactcact 4020 

agaattagca aacaaaacca ccctgacatc ctccttcagg aacacgggga gcagaggcca 4080 
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aagcactaag gggagggcgc atacccgaga cgattgtatg aagaaaatat ggaggaactg 4140 
ttacatgttc ggtactaagt cattttcagg ggattgaaag actattgctg gatttcatga 4200 
tgctgactgg cgttagctga ttaacccatg taaataggca cttaaataga agcaggaaag 4260 
ggagacaaag actggcttct ggacttcctc cctgatcccc acccttactc atcaccttgc 4320 
agtggccaga attagggaat cagaatcaaa ccagtgtaag gcagtgctgg ctgccattgc 4380 
ctggtcacat tgaaattggt ggcttcattc tagatgtagc ttgtgcagat gtagcaggaa 44 40 
aataggaaaa cctaccatct cagtgagcac cagctgcctc ccaaaggagg ggcagccgtg 4 500 
cttatatttt tatggttaca atggcacaaa attattatca acctaactaa aacattcctt 4560 
ttctcttttt tccgtaatta ctaggtagtt ttctaattct ctcttttgga agtatgattt 4 620 
ttttaaagtc tttacgatgt aaaatattta ttttttactt attctggaag atctggctga 4 680 
aggattattc atggaacagg aagaagcgta aagactatcc atgtcatctt tgttgagagt 4740 
cttcgtgact gtaagattgt aaatacagat tatttattaa ctctgttctg cctggaaatt 4800 
taggcttcat acggaaagtg tttgagagca agtagttgac atttatcagc aaatctcttg 4 8 60 
caagaacagc acaaggaaaa tcagtctaat aagctgctct gccccttgtg ctcagagtgg 4 920 
atgttatggg attccttttt tctctgtttt atcttttcaa gtggaattag ttggttatcc 4 980 
atttgcaaat gttttaaatt gcaaagaaag ccatgaggtc ttcaatactg ttttacccca 5040 
tcccttgtgc atatttccag ggagaaggaa agcatataca cttttttctt tcatttttcc 5100 
aaaagagaaa aaaatgacaa aaggtgaaac ttacatacaa atattacctc atttgttgtg 5160 
tgactgagta aagaattttt ggatcaagcg gaaagagttt aagtgtctaa caaacttaaa 5220 
gctactgtag tacctaaaaa gtcagtgttg tacatagcat aaaaactctg cagagaagta 5280 
ttcccaataa ggaaatagca ttgaaatgtt aaatacaatt tctgaaagtt atgttttttt 534 0 
tctatcatct ggtataccat tgctttattt ttataaatta ttttctcatt gccattggaa 5400 
tagaatattc agattgtgta gatatgctat ttaaataatt tatcaggaaa tactgcctgt 54 60 
agagttagta tttctatttt tatataatgt ttgcacactg aattgaagaa ttgttggttt 5520 
tttctttttt ttgttttttt tttttttttt tttttttttg cttttgacct cccattttta 5580 
ctatttgcca ataccttttt ctaggaatgt gctttttttt gtacacattt ttatccattt 5640 



WO 99/37660 PCT/US99/01313 

-46- 

tacattctaa agcagtgtaa gttgtatatt actgtttctt atgtacaagg aacaacaata 5700 
aatcatatgg aaatttatat tt 5722 

<210> 23 

<211> 42521 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 23 

gatcgttttc cagacatttt tgttctctgt tcatttcctt atcgtattca aaaagttatc 60 

acaaatgacc ttctctatct gtctgcgtct cttttaactc tcaccgtttg ggacctttca 120 

aatagttttt cgctatcaaa tctaaacatt agttgcgttg actcgacatt tgacccctca 180 

ctatcatctc cagttctctt ttttgttaca ctttagcagt ggcagcagag agcaagtagg 240 

tggagccaaa gtgtgcgcca ttcatcggga aaaattgtgt tcttcatcaa attttgggca 300 

atttactcgg gatttgcgct aatttggaaa caaaaattca aattcctgcc aattgttttg 360 

tttgcttttt ttcttttttt tttgctcctc ccatctctca tcaaattgct cttttttcga 420 

ttctaacata tcagccatct tcagagtgtg tcactaaccc ccatttttat tcaaggttag 480 

tgatatagta tcctaactac agacgtcaca ccatgaggtt gctgctcttc tcggcagccc 54 0 

ttcttctgtg ctccgtccca acgtgggcct tctctctgtc atcattcttc ggaagcgatg 600 

ttgcacaagt aagcaagctc tcctatacct agaatcttgt aaattgaaaa ctctaatttc 660 

cagaagccat accttcatcc aaactcccca ccggagcgtg acccggcgag ttccagaatg 720 

aagagacagg catatcaagt gtacgttgat ggagatgttt ccgttactgt tgacaagtct 780 

ggacaaaagg aaaccggcaa ctggggacca tgggtgcccg agaacgagtg ctcacgttcg 84 0 

tgtggtggag gagttcaact cgagaagaga cagtgcaggt tcgtggactt ttcatttttt 900 

agggaatttc ctagacgttc taaaagctta ttttcaaaaa ttt tggtttc* ctgatcttca 960 

tgcctttatg aacgtggtga aagatcaacc taggctagcc tgtgacatac attttttgaa 1020 

gcagatccaa ctttatcaag agccatcgaa ttctcgtttt aaagtgtttt ttttttctga 1080 

taactttttt ctaatagctt tacccatttt tatgtcaaga ctgaaagcaa tgaatcacaa 1140 
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gaggctatct acgtttgttt ttgaagctct 
taatggagat gaaattctaa ttttttaaaa 
agttcaaact tttcaaacgg ttattctcaa 
ctatctattt cttgcaagca acccaccgaa 
ggtgactgca ctggagcttc agtccgctac 
ggtactgatt tccgtgctga gcaatgctcc 
taccacaagt ggactccata caagggaaag 
tttctaattt tcgagttttc aggtgcgagc 
actacaagtg ggctgataag gttgttgatg 
tctgtgttga tggggaatgt cttccagttg 
ttaaaattta attcaaaatc ttcatttcat 
gtgcggaaag tgcgatggag atggttctac 
gcgcaatctc tctccaggat accatgatat 
taagattcag gaagccagaa agagcaccaa 
cttttatttg aatggaaatg gattgatcca 
tatcttcgtt tacgatgacg ctgaaccaga 
ggagctcacc gttgctcttc tcttcagaaa 
gttctctatt ccacttgagg aggaagttga 
gtgctctgta tcatgcggaa agggtgttca 
gaacaaggga cgcgttgagg atgatctctg 
cgaaaagagc tgtgaaactg ttgactgtga 
ttgctcatcc acctgcggag atcaaggaca 
attcgctaac ggacgtcgtg ttaccgttga 
agtaaagcag acttgcaatc ggtaagttga 
ctatttgttt atgcgatgct atccatatat 
ataaacaatt attttctctt tgcagttttg 
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gtaggaatca tcttaaaaaa ttaagtaaag 1200 
tcataatcat tactttctgt attatcttca 1260 
gaaactcaca tagaatttta acaatttcct 1320 
ctcaaatctt atccaaacta aacttttagt 1380 
atctcgtgta acttgaacgc atgcgagtct 14 40 
aaattcaacg atgaggctct tgatggaaac 1500 
aacaagtaag ttaactttct tcaagatgtt 1560 
tcgtctgtaa gccagaatct ggaaacttct 1620 
gaaccaagtg cgactccaag agcaacgata 1680 
gatgtgacgg aaagcttgga tcttgtaagt 17 40 
gccgaatatt tcagctctca aattcgacaa 1800 
ctgcaagact attgaaggac gtttcgatga 18 60 
tatcaaactt ccagaaggag ccaccaacat 1920 
caacttggct ctgaagaacg gttccgatca 1980 
agttgagaag gaggttgaag tcggaggaac 2040 
aactctcagt gctcaaggac cactctccga 2100 
gggaagccgt gatactgcta . tcaagtacga 2160 
ctacatgtac aagtttgaca actggactcc 2220 
aacccgtaat ctctactgta ttgatggaaa 2280 
cgaggagaac aatgccacaa agccagagtt 2340 
agccgaatgg ttcactggag actgggaatc 2400 
gcaataccgt gtcgtctact gccatcaagt 24 60 
ggatggaaac tgcaccgttg agagaccacc 2520 
ttttataaat gcataaacaa ctctgtgaat 2580 
attaccagat ggtgttggtg cccaaaactt 264 0 
cctgcccaga gtggcaagct ggtccgtggt 2700 
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cggcttgctc agagaagtgt ggagacgcct tccaatacag atcggtgacc tgccgcagtg 2760 
agaaggaagg agaagaggga aaactcttgg ccgctgatgc ttgcccagct gatgagcaag 2820 
agaagttcga cacagagaga acttgcaatt tgggaccatg cgagggactt acatttgtca 2880 
ctggagaatg gaacttggtt agattttgca aaatatgggg acctggggaa aagcatacta 2940 
aataagatca actttatgaa acaaataatt tttagtgcac ccgctgcaac gatactgagg 3000 
agactcgtga agtcacctgc aaggactccc aaggaagagc ctatccactc gagaagtgtt 3060 
tggttgataa ctccaccgag attccaactg atactaggtg agtcattcca gatatgacat 3120 
tgaacttgga ttaatttttt tcttccagat catgcgccac ccaaccacca tgtgagtacg 3180 
agtggaccgt cagtgagtgg agcaagtgta ccaccgaatg cggacacgga cacaagactc 3240 
gtcgtgttat ctgtgccatc caccaaaacg gaggactcga ggttgttgat gaaggacact 3300 
gtcaagctga gaagccagaa ggaaagacta actgcaccaa tgaggagaag tgtactggaa 3360 
catggtacac atcttcatgg tccgagtgta ccgctgaatg tggtggtgga tcccaagatc 3420 
gtgtcgctgt ttgcttgaac tacgataaga agccagttcc agaatggtgc gacgaagccg 34 80 
tcaagccatc tgagaaacaa gattgtaacg ttgatgactg cccaacttgc gttgactctg 3540 
agttcggatg ctgcccagat aactctactt ttgctaccgg agaattcaac ttcggatgct 3600 
ctaactgctc ggaaacagaa ttcggatgct gtgctgacaa tgttaccgtt gccactggac 3660 
ctaactccaa gggatgcgaa gaattcgttg agtctccact taaccttgaa gctgatgttg 3720 
ccaatgctga cgctgaagct tcaggagatg ctccagaact ctgcagcgtc acaaacgaga 37 80 
acggagaagc tgttgatgtt gagtgtgcca ccattgctcc aatcactgct cttcttggag 38 4 0 
atggggaact tatcggaaat gatactgatg cttccaatga gaccatacac tgctcgaaga 3900 
ccgaattcgg atgctgtcca gattggtaca ccgccgcctc tggaaagggt aacgaaggat 39 60 
gcccatcgtt cactcttgga ggatgtaacg agactcaatt cggatgttgt cacgatgatg 4020 
tcactcttgc tcgtggagcc aaccttgaag gatgcggaga gccatcttgc gctgcttccc 4080 
tctatggatg ctgtaaagat cgtaagacaa ttgccttcgg accacactat tctggatgtg 4140 
agcgatcatc cttcccatgt gagcttagcg acttcggatg ctgcccagat ggtgagactg 4200 
ctgctcttgg aaagaatgga accggatgcg gagagaactg cttgaccacc aagttcggat 4260 
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gctgccctga tggaaagacc accgccaagg ggtcccacaa cgagggatgc ggatgcgagt 4320 
tcgcccaata cggatgctgc ccagacggaa aatcagttgc caagggagcc ggattttacg 4380 
gatgcccaga aagctgcgcc cagagccagt tcggatgctg cccagacgga aagactcgtg 4 4 40 
ctcgcggaga gaacaaggaa ggatgtccat gccagtacac ccgttacgga tgctgcccag 4500 
atggggagac tactgctctt ggaccacgca atgatggatg tgataactgc cgctacgcca 4560 
agcacggatg ttgcccagat ggagagacca aggctcttgg accagatgga gccggatgcc 4 620 
caccaactac cacgccacca ttcctcatgg gaggaactgt tgccccacat aaaatcgccg 4 680 
cctgtaatca gacacaagaa agtggaaccg tctgcggagc cggatacaag cttgtaagta 4740 
attaacctca tgaaaaagaa ttggagcaac acatttcatg tataaatatt tcaatttcag 4800 
gcatggcatt atgataccac tgagggacgt tgcaaccagt tctggtacgg aggatgcggt 4860 
ggaaatgaca acaactttgc tagccaggat atgtgcgaga ctatctgcgt cgaaccacca 4920 
ggcaagggaa gatgttacct gccacgtgtt gatggaccac tccggtgtga ccaacttcag 4 980 
ccaagatact attatgatca ttccaagaag cactgtgtgg ccttctggtg gagaggatgt 5040 
ctcggaaatg ccaacaactt caactctttc gaagaatgct ccatgttctg taaggacgtt 5100 
ggaccgtacg atgctccaac caccgctgct ccaccaccac caccacagca aaatgctcag 5160 
caataccttc caactccaga agttcaacag attgagattc aatctgctga gcaacctcaa 5220 
ccacaacagc cacaacaaca gcaacagcaa caacagcaac aaccacagca accacgtcaa 5280 
tcaatggaag acatctgcag atcccgccaa gacgccggac catgcgagac ttactccgat 534 0 
caatggttct acaacgcttt cagcqaagaa tgcgaaacct tcacttatgg aggatgtgga 54 00 
ggaaatctca atcgtttccg cagcaaggat gaatgcgagc agcgttgttzt cttcgttcac 54 60 
ggagctcagc catccgctgc ccggcaggaa caagctcagc cagcagctca accagctcaa 5520 
ccagctcagc caagtaacat cgtctctcca ccacaacagt cagctagtcc agttgtggtt 5580 
ccatgtaagt tctttagaat gcatttattt cttactataa gtttctataa gttcgcatgt 5640 
gaagcatccc catttcagcg aacagcaaac aacgcgatgc ttgccacctc aacgttgacc 5700 
aaggacgttg taagggggct tttgactcct ggtactacga agttgccacc ggatcctgcg 5760 
tcacattcaa gtacaccgga tgcggaggaa acgccaacag atttgctagc aaggatcagt 5820 



WO 99/37660 

gcgagtcact ctgtgtgaag ccagcttctg 
gagttataga gaatgttcac tatttttgtt 
gtttgaaaat gtttgcacca tgtttcaaaa 
aaaatcttaa ttacactcca gaagtacatt 
aaaaaggttt aatattgagg tttcaaatta 
aaactgattt ttaaaatctt atttttgaag 
ccaacttttt cagacggtgc agctggaatc 
ggaccgtgca ccaactttgt cacgaaatgg 
cgattccatt acggtggatg ccaaggaacc 
aaggctgctt gtcaaaatca taaggatgct 
tctggaaagc attcctatta ttactacaac 
tatggtggct gcctcggaaa tactaacaga 
tgcccgagta agttctaagt taatagtgat 
aattttcaaa tactttttgc ataattacct 
ctccacccgc tacaaattgt ttcccgtact 
acacggcaat gctgcctaaa tgaactgcct 
aaaaaaaaca attctctcaa aaaattccct 
gttgcgctat tttttttaaa taaatgaata 
gcatgtaagt ttgtagtgta tcagtgtgtt 
tttcctgaac tgcatgaaac tgttcttatt 
gtgtgatttt gtttcagaag acgaccagac 
aagtttgcca cttgttcaag aagatcctca 
acgtgtagtc caagtgccta cttctcgtat 
tattaaggaa tcagtagcat gtaaattgtg 
ttctcactca caatcagata tggagtagct 
tgtcataaca tccaaattta attattaaaa 
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aagctgcttc agccggaatt ggtatgcttt 5880 
aaatgtttga gtaaatgaga aactggctca 5940 
tagtttttga gttgaatagt tgaggccatg 6000 
ttaaaacatt tttgagaatt aggtcttcaa 6060 
gaaatattaa tatacgggga tttgggttta 6120 
tttcgctttg atattcgtgc aaaaaaaaaa 6180 
aactcagttt gtgacgaagc caaggacacc 6240 
tactacaaca aagccgacgg aacctgcaac 6300 
aacaatcgat tcgacaacga gcaacagtgc 6360 
tgtcaacttc caaaggttca aggaccatgc 6420 
actgccagtc atcaatgcga gacgttcact 64 80 
ttcgctacca ttgaggagtg tcaagcgaga 6540 
atatgctttg tttccccttt attctttgac 6600 
tatttctatt cccttctgtt tcccattttc 6660 
ctctcctttc tcactttccc gtccgaaggg 6720 
aataatattt atgaattttc caattttcta 6780 
gccgttccgc cactgctttc ttcacccatt 6840 
aagctgaaat agttaacagt ttctgaaatt 6900 
tgtcgtgaaa gttttttttt acctgcatga 6960 
acgttttaga tttgctgaag tgtgctagaa 7020 
tacaacaaca tcacaaccag aagagctccc 7080 
gccacgaccg gcattttcat tgaagtaagc 7140 
gaccaaaaaa tttaatataa ggtttccaag 7200 
tggattgttc tcctgggttg atgggttttt 7260 
tatatgggaa tttatttgag aaatagaata 7320 
agttgtgaag tttctcatta tgtatataaa 7380 
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attcgccttt caaataagaa caaaaattaa 
ttgagaaaat aactggttca aaaagaagaa 
ggattttctt ttcaggtcgg ggaaatttcg 
aaatatacac caaaaattat ttttaccata 
agtattcatg taaggttact gtattgttac 
agttgtaaga agtatagaga ttactgtaga 
aataatctgt ggggatagaa tgttgaaggc 
ttttaacagt gattttttta agcatatcct 
ttcccactaa actctttaag acaatccttc 
gcacgttccg tatccgcccg tcaccatact 
tatgctgttc cagatccagg atcttgcggg 
taccgtctta atgattcaga gactaccgtc 
catgccgtca attctactat ggtggatgtg 
ataaatgtga aacatcgtgt gttgctaaga 
cttcaaaatc tctggaagag gttagactaa 
atcatgatcc agaagttgat caaatcgaag 
gagctctacc tgaattatgc atgcttccag 
tgagatggag gtaagtcaaa tcaagaatag 
tatactaaaa gcaaaatctt aaaatctttc 
accttcatgt attctggatg taatccaaat 
attattttgt gtcctttcgt ggaactggcc 
gaatacccaa tttgagcacc gctggctcac 
cggcctctgt attttctgag cactcttgag 
ttcaagaggg agtgagagag agaaataaac 
tagaagtcga tgccgttctg cagtatcctt 
ttatggatta ggaactaaga aatcatcact 
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ctgtatgaaa gagctgaatt caatttgaaa 7440 
aaacgttgga aaatctagac gtaaatctat 7500 
acgattttta tattttcaaa aatcattcac 7560 
ataaaatacg gaatttcact ggattactgt 7620 
tctagggata ctacaagaat atttttgcaa 7680 
ttgaaaatct agacaaaaat cattttccgt 7740 
acaaggctta taaagcacca tgggaaaaaa 7800 
ctttcccagg aaatccactt ttcaaatata 7860 
gcccatagtc gtcgccgtga tgctccattt 7920 
cctgattccg aagaggaacg agttgactgt 7 980 
taataaatct cacctatcca ttacaaccat 804 0 
ttgtttggca ctactctgcc acgagtaact 8100 
ctgggaatac gaatcgcttc gagacccggg 8160 
ttgaagaacg cgtggaaagt gtgtcagaag 8220 
cggatccaag gatggattct cactttggat 8280 
aagaagctga atatgtcatt gttgataccg 8340 
aacaaagagg gtcttgttat gataacattt 8400 
aaaattcgaa aatccgaaaa actttataat 84 60 
agattcgact ctgaaaagtc tcaatgtgta 8520 
gcaaatcact tcactagtca ggttagtttc 8580 
ccttggtttc taacttgatc ttctccttcc 8640 
tttttcgacg gtgacgttcc tcaattctag 8700 
caacagtttc ctcactggaa atgtttgttt 8760 
gtacaatttt tgaagccgca catgatttgt 8820 
catgtttcgt agttgtttct gtagtaattt 8880 
cactgcggta gttgcatttt tgtgcatgca 8940 
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tcttcccata aaagcaacaa atgcaacaac 
tcgaagtcga tttctaattc ctttctttac 
gtgcttcttc ttgctggggt cgagctcgca 
ttttttgcat tgttcacaac ggcgtccagt 
gcggtttttg aaatctgaaa attttaaatt 
tcacagtcct atgtctatat taaaaaaaat 
tttttcagta tttctataaa aactgcattc 
acttagtctt taagtacctg gggatccgta 
ttcaaatccc acatcacaga tgcatcttcc 
tgtatattgg cattcacttc caaaatatga 
ttctcctttg tttccgcact gaaataattc 
atataccttt tccgctggcc gatagtcaca 
gcacggctca ctgaatagaa ggctttccgg 
aactgtaaat tgaaaatttg taattccaaa 
aagaatttta gagttagaca ttatttttct 
ataaaatatt taataggatt gtagagcttg 
aaaggtatcc aaaaaggtat tgagatagtt 
ttttaagttc tacaaatctc ttataacatc 
accataccac attcaaaatg ttcacattta 
atccctccag catagattcc tccattacga 
ttgtctagaa tacttctctt ttgaagaata 
agatctagtg agtcatctcg tgattgtgct 
gtggaaaatt gtaaacagta catagcgtta 
atttattgcg tcttgaaagg ggtatcaatg 
tataagagaa tacaaaagat gacgtaattg 
aagtatcaat tattatctgt gaagacagtc 
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tgatagagcc gccacacaaa ttgcaataat 9000 
tttttgtcta tgctcagctg ctttttcgat 9060 
atgaggaaat ggttcgatga gtggaccgtg 9120 
gtatttgtct gggcagtcac acgaaagagt 9180 
taagaacagg atctatagca gttttgccca 9240 
tatcggacat taaaaaaaat gttttctcat 9300 
gcatttaatc ataactttta atcgttaaaa 9360 
aacacagaca atttcatcac aataatcgcc 9420 
atttctcaaa aacccctcga cacatttact 9480 
gcccacacat tcacatcggt cccccttcca 954 0 
aatagatttt ggaagtttag ggcctcaaaa 9600 
catttcacct ttccatccga cttcgcaaat 9660 
gtcgaagcgg aatccgaccg agagtccgtg 9720 
aaaaaaacag cttttgcaaa aatcgtccaa 9780 
caaaaagttc aaagttgtat cagttttaaa 9840 
ttagaaaaaa taaaagctac ttgaaaaaag 9900 
tcaagcaact ctatttgtaa actgtcgagt 9960 
gctacatcta ctatcaaact ttgaaaaaaa 10020 
tctccagtct gtcccttgat acaatgacaa 10080 
cattcggctc tcggatcatc cagagcaaca 10140 
cgatgcacgt cgctcaatat attttcatct 10200 
tttgttgttg ataaaaatag gaagagtaaa 10260 
gatactgaca agtctactat caattgattt 10320 
agagaaatag ggagatgggt aaaatgcatt 10380 
attaatcaga gatcagttga aaatactttt 10440 
acgtgactct gactcgaact caatttgcat 10500 
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gttgatagtt ccaatgttaa agaaagtctt tgggttttct ccagatgaaa caaatgattt 10560 

tggaatatta aacgtgactc ttctctgaca aggtttgagt ccgtcatcac aatcgtgata 10620 

gatattaagt tttggatcaa tagtcatcac ttcggaagtg tgtccggtaa gaaggaattg 10680 

accaagagag tctgtagttc cttcggcaag aagatcgtca agatccggtc ctgaaaaaaa 10740 

cttttatttt gaaaaatttc aatgagttgc ttcatgttag aatttggaat ttttaaagat 10800 

gttagcaatt ggtatttaaa tgttcaagct aacgtaatta gagttattca aacaagcttt 10860 

atataaaaac tttgtgtaag attcggtcta attagaacat caatttttaa cgcagctgat 10920 

aaaaaacttt aatttcaagc ttcacataat tctacttacc ggtatcatca tcgtagagct 10980 

tcaccttcgt gttagccagt ggtttgtctc cacacatcag aacaccctta actccagctg 11040 

attgggtgaa tacagcttcg gagccaattg cacaaagtat gaaaagtgat gaaatgcacg 11100 

cgagtcgtga cattattttt gtctgaaaat acaaacactg actgatctga ccttcatcgg 11160 

agaaactctc ttatagcaca gttggttaga aaaagatacg gagaggagaa gtgggaaatc 11220 

gaattgacca aacaaaagaa ctggttttca cttgaaatag aagacgatga aagatataca 11280 

acagagaaga tcggaagtga ttcatctgga gaagaaaatt gagaggagca acttcttgta 11340 

ttttccactt atttatatac ccaatagaat tcacctgatt ctttccgatt tgtgtacatt 11400 

tcgctgacta acgtgtgctt cttcggtttt gtcatttctt attgttcatt gaaaataaac 114 60 

agaacaaagc aatcataagg tcgaaaatcc catttagaga tcaagaggtg tacctttaat 11520 

tgtgcggcat ggcatagttt tatcttgctg aactctcacc aattgatgag tatgtcagta 11580 

gaatggattc catccgatcg ttgctccacg gtgatctctt ccgccgcctt ttcatccacc 11640 

atacccgttg tgtatggctg gcaactgtga acagcgcctc agtggaatgt ttagtttgat 11700 

atacagttta aaataatttt ctaaactaaa gaaatcagtt tttgaaacca gtcttgtagg 11760 

catgtcgggc gcaggcacgc taacgtgaaa aatagaattt cgagtggtta actattttat 11820 

tttcaattaa aatacaatca actacacaat gaatgacccg gataaatgaa atacaaatac 11880 

aagaatttaa aaaaaacatg gaaatttaaa cttttccatc atctcccttt gctggaatat 11940 

tatatttcat tcgataagct tccaattcgg cttttctctg atcggatcgt acactgtgtc 12000 

tctcatccat ctcttgttga gctgtcattc tcttctcatt ccatttctga gcttttgctt 12060 
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ttttgtggat 
gaattacagc 
cccagcagac 
gcatacatct 
aattttcaat 
tcaacttctg 
agtttagaac 
gtttacgcgt 
attaagaaac 
acgcaagttt 
ccaacacatg 
tctgaccatg 
aatcggtgaa 
actgagcgac 
ctcgcgtgtg 
ggcagaagaa 
aaagaaaatg 
gaatggtaaa 
tgaaagataa 
gtttttgtat 
tttagagtgg 
aaataaagaa 
tgaattacat 
ttcaaaaacg 
ttcagctttt 
caaaagtgga 



tctgttgtat 
aatgattcca 
attgtatttg 
tttcgttttt 
atgtttgctt 
aaattaggaa 
tacaggtgtg 
caatttattc 
ttgtttcaga 
ctttgaaaac 
tttcacaagt 
tcatgttatc 
tcatcgttct 
aaaaccaaag 
tttgtctatc 
gaacatacga 
catttgagaa 
aataaaggtg 
caaaaaaa'aa 
agtgcaaatt 
gtcttgtcac 
actaggcgcc 
taaattccaa 
tttttaaaat 
ctctctctct 
agtgaatgag 



tctttgcact 
gcgcagatgg 
acatgcttga 
ccaccacacc 
ataaaattga 
agtattctca 
aaaaaatctg 
gagtgttcat 
catttttgct 
aaaaacaaaa 
gttcaaatcg 
cgcacatact 
gaaaaatcaa 
tagtaatgaa 
gagtgccaat 
actatgtcgg 
aatgcaaatg 
gatcagcaaa 
taaatacaga 
cagtagttgt 
gatccatatt 
aagaagctat 
caaaatagga 
tgttcataag 
ctattctcta 
gatggaaata 
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tgcagcagca 
caataacgat 
tattacagtc 
agaagcaatt 
atttaatttt 
attgagagct 
agcaagtgaa 
tgtagagaaa 
cttttagagg 
tatatttttt 
ttcgatccaa 
gatactagaa 
taaatagtaa 
atagaaagat 
gagttttagg 
gctacaagat 
ttcggcagaa 
aatagttgaa 
aaacaatata 
acctgttata 
ttttgaacgc 
agtagctata 
aaatcatata 
cgccgagctg 
tcgtcaccta 
tgataccgca 



ataacagaag 
tgcggcggct 
tggatagtac 
ctgaaaaaat 
tcaaacagtg 
gtttttgtat 
caccaacgta 
gttaggtcac 
aatttttttt 
atccacttac 
ttctacagta 
caattgagaa 
caacttgagc 
agaaaggtaa 
agtagcgaca 
tcttgtgttt 
atcgaatgga 
caaatatttt 
tgacgtattt 
agtacagcga 
aatatttgaa 
cgcataaatt 
aaaacgaagt 
tccccctcag 
tatttcatag 
tgcttcaaaa 



caggcaatga 

gatgtgctca 

cagtcgaatg 

gtgtttttga 

tttcagaaac 

taaaagtttc 

ttgcatcaca 

cttccagaaa 

tagaggaaac 

cgagcccttg 

ttcttgtttc 

aaagagtagt 

aagtctcgta 

actcaaaggg 

gaaataagtt 

actttttgaa 

gtttagagca 

gtagatttca 

ttcaatcatt 

agttatacat 

atccaaaaaa 

gtgaatacct 

tagttgtcaa 

ttttcgttta 

tccccttatc 

aaatttgctt 
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atgagaaacc aacatttgaa aatttccagg aaacttgtga acgagcctgt ggtaaatgga 13680 

gaaatgtggc agtgtgcgag ttgccggccg aacacggaga ttgccaactt gcgattccca 13740 

ggtatgtact gttgacacat tttacaaatg ggatgggaag tggtcggtga tcaggtggaa 13800 

atgttgatgg caaggtttta aatagatgta gtaactgaaa acaaaatgac agatgtacat 13860 

acataaatta ggattaaaac aaaaatacta tgcggagtca ggtgactaat ttttctggaa 13920 

attccagaat ttgaaaatgt ttttctctgt ttgaaagtag aacgggacct tttacaaaat 13980 

aggctgaggt aggtaggctg tagaaagtgc ctttggtgtc tttgtaattt ttgttttcaa 1404 0 

aaaatcactt gtaagcacat gaaaatcaca tgaataatga tgtaaaattt agaaaattag 14100 

tataaagaag atttacattt taataataat aattccagat ggtaccatga cccaaaaaca 14160 

tcccaatgtc aaatgatgat gtggactgga tgcggaggaa atggaaacgc gttctcttca 14220 

aaagcagact gtgaatctct ttgccgagtt gagacattat ggtccaacaa cactgacttc 14280 

tgtacattgg aacgatcggc cggtccatgt: acagattcta tttcaatgtg gtatttcgat 14340 

tcaactcatc tcgattgtaa gccattcact tatggaggtt gccgtggaaa tcagaatcga 144 00 

ttcgttagca aagagcaatg tcagcagagc tgccgtcctg gagacacaaa atctgaggat 144 60 

atctgcacac tccgcccaga gccgggaccg tgtcggctgg gactcgagaa atacttttac 14520 

gacccggtga tccaatcctg tcatatgttc cattatggag gttgtgaggg aaatgcaaac 14580 

cggttcgatt cagagttgga ctgcttccga cgatgctcga gtgtcaaggt tgaagcaagt 14 64 0 

gaaagcgaga gagtgggaca gctgacgtct gcatccacgc cagttattta tattgttaac 14700 

aaaacagcga tttttgttgg aaatactgta agttattaat tttaattcga agatttctta 14760 

atatttaaac tggtcccatg agagtttggt tcattttccg acaatagact gcaaaattga 14820 

taacttttca tgaacacttt agccgatttt agctagtttt gtttattaaa atttggtaat 14880 

tcaaaataaa aaccttacgc cactccactt ttgaatactt gtcaaataca ttttttcagt 14940 

tccgaatccg atgcaacagt tacggagtgc ttccaataac atggtacaag aacggaggtc 15000 

tcctccagtt cggctcgcga atcactgaag agaatgatga cactttggaa attgtggatg 15060 

ctttaactgc tgacgccggt gtctacactt gcattgccgg ccaggatagt acaatgagcg 15120 

agggagtcga ggttgtgatc aagagacttc ctggtcacag aactacatct cgtccaatgc 15180 



WO 99/37660 

tgacaccatc caagaacttc tccttgggaa 
ctacaacacc cttccgaatc tatacgcctg 
gcccgacaag caattcctgt atggatgtgg 
agaacggttt gtgcgggaag aagcgatatg 
ttcataattt taaattttaa gtttggattt 
tgttttctcc ttcataatat ctccattgcg 
ttcccctcag acaattggct aattactcgt 
cttcttccta tactttggta tacataatca 
aaaaaacaat ccacttgaaa tccattcagg 
gtgccattac ttccacttat tttgcctatt 
gtctaactct gctgattgca agcttcccag 
tacgccttcc tagtgggttc acgtttcctg 
cgtatattcc agcgacggta aattttcgct 
attttctagg acaaaaattt aaagtaggct 
caacaggcta gcttttggag agaaatcaaa 
aagcgtccga aagtttttgg gaatccgcta 
aggtttattt tgcactatta aggcgtgtag 
agccttacaa tcaagtatta ttcatgaagg 
attacttata caaaaacatg aaaaatttca 
agctccacca cgctatttaa ctggagaaca 
atctgatttt ctaaatttaa cttcaagctc 
ctgagtacaa gcaagcgatg gccccacaac 
ctcctgcacc aatggtccca ccggttactg 
tgactacgat tgcatctgtt ggaccactta 
gaaagtttgt gattttagtt aattgatctt 
gaggaattgc tcaaaacttg aacgacaggt 
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ccccaccgac accatctcca tctacagttt 15240 
gatctgctcc atctgatgct cgtgtaagcc 15300 
gtaacgcgag cacgtgcgat ttgatcgtga 15360 
gaacattctg ctgtcacact tgcacccggg 15420 
tttgatttca aattttcatt aatcttttaa 15480 
agatctcttt ttcccttctc ttcctatact 15540 
tcgttccagt aaataaatat gaatttattt 15600 
tggcatgaaa tacaagacaa aaaaaacaag 15660 
tgtgaactaa catcttactc tattaacttc 15720 
cactaatgaa gtctctgaga attattttct 15780 
ctcagcggag ccgccgaaaa cagaaatttg 1584 0 
cggatgcggc gagtaatttt caaagagatg 15900 
ttttgttaaa tgaatttcag gcttcaaatt 15960 
tgcgcatact catttccctg ccttacctgc 16020 
agtttggtgt ctgtaaatct aagctttccg 16080 
tacactttaa gattgataaa tatttgaatc 16140 
gcactaggcc ggcaaagctc gcctacgggg 16200 
tcttgatttg gttacagaat tccatctaaa 16260 
gtttgccccg ccatctgaga agattcttca 16320 
caatccagct tatggtaggc ccaatttttt 16380 
acaataccga tgtgcaagga atgaactacg 16440 
cacatccagt cgatgcttat tctccaccac 16500 
tagttgaacc acctgcaatg ccgtatgaaa 16560 
ctactcccgc atcagtcggc ttgaagaagg 16620 
tcaagtaatt ggatacaatt tccagcatcg 16680 
acaccagctt aacaccagaa gctcaacgtg 16740 
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ctcagaaagg tcatacctat acggctctgg gcggtggaca attctatcaa agtttacttg 16800 
gaggggtaag atgcaaggtt agaacttaca aactcaattc attttacaga aaggaggccc 16860 
cggaggattc tccccactct cgttctttct aaacggcggt ctaggaggta ctggtggtgg 16920 
tggtaacaat ggattcttcg ttccggtgcc tgtagtcatt ccgcctccac cgccaccgcc 16980 
accaggacca aactgtttca cgaacccgtc gggattcctt tgctgtaacg tgacacttga 17040 
gaaaactatg gaagacgcgt acctggccgc aaaagcagat ggtgcatcac tgtgcaatgt 17100 
acagaaaatg gcaactgcag tgcaagcggt ggggtttatg gatttcattt tataatgtaa 17160 
tgtgctcttc cctagaattg aataagctta caacttgaat tacgacttga attacaactt 17220 
gaataagctt aaaatatcca ccaaatttca gcaagccgaa aaaaaattcg gaacaacttt 17 280 
cgaatcagtc gctgctcatt cggacttcgt cgcaaaaatt aattttgccg gtgacctgaa 17340 
ctgtaaaata gaaatcgatg ggaaattcat actagcgtac gcaactccaa tcgccgagca 17400 
agaggtgaac attgtcgatg ctagctcatt cttctcggga gctgctgata aggatttgga 17460 
tggtgtcaat ggtaccaagc ccacctacat tgtctacggt cccattaaat aatggagggt 17520 
ctagctttaa agatttctgt atattaaagc tgaaatgtga attaattgtt tatttgccaa 17580 
tcacaataaa gttggaaata tcatttgaat agttcgaaag ttttcaatcg gaatgggaga 17640 
aaattcgaaa atttaggtgg aggtgaaaag ttgatgaagt aacacaatta actgtgctcg 17700 
aatcctgaat agaaggagaa aagagcctat aaacagattt tcaatttaca catattacac 17760 
aacaattcag gaagaagaca gtagttgcaa aagaaaatac gtagaaaaaa gagtgaagga 17820 
ctggcgggat gtcagtttgg atgtacaaat agaactcctg aagcataaga aacagaagaa 17880 
tcgaccgatg atcgaacctg aaatggattt attgttgatt gaaaaatatt aagcaattct 17940 
gaatctctac cttgtttgat tgtgtgtaat gcaagaatct aaactcgtga gtgtgattgt 18000 
tactgatccg gaaatgttcg gctgcttgca gcattatcaa tatcggatta cgcccacaaa 18060 
tcgtgttctg ggtctttttg aggtagtcat taaaagctgc cggattaagc gtctcaattg 18120 
cgctcattcc ctgcttatcc atattggtta tctgctcata aatcggaata gaactatgac 18180 
gatcgtacgg agaaaagctg aagcgttctc cccaatggca aaagtccgaa gagatcacaa 18240 
acaagtttct tggatcctcc atgtaatgag caaaaatatt tccatacgtt tgctgcctag 18300 
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atcctggtaa agatccaaca agtaccggaa caatggtgta acgttttgaa cccataacct 18360 
ttgcaataaa tgggagttgc atttcaatac tatgctctga ttcttcatct cggcgatcca 18420 
tcaaatcgaa atgacgagtg gcacgaagct cctcgttaac tgcaaagggc aatgttgtaa 18480 
aagatgtact aagagtgcaa tagattactt ttgtgatcaa cgatcaagtc gccgagtgga 18540 
gttctgtact tgctgcatgt ggttatagca catccattta gagcaacaac gtgagatggg 18600 
ccaagaatga agactctttc actgaaagtt attgagtaag ccctgttgcc aagtacaaat 18660 
ttcaacaact cacactgctg atgaaacaac ttgtttgaaa gcatatgcag ctgtttctcc 18720 
acaatacgaa tatcccgcat gtctgaaagt tatcagaaaa taaatattaa atgcatttag 18780 
agtattacgg tgaaatcaac gctcgagccg ttccaatccg tggaccggcg ttgtcaagcc 18840 
attttgtgag ttgccgatca agatctcgct ggttggcgtt gtaccatgat ccggcatgtg 18 900 
aggcagatct cgtgtgctcg ccgaatccgt ttagtgacat tttaaattca gatggtctga 18960 
atattaaagt tttgataaat tgttgtatac gacttgatta atatgtttag tagggttttc 19020 
aactactgtg tgtttcccaa atagtcaaca ttgaaaaatg gaaaagtttg aatttaaata 19080 
ttcaaataat tttaattaat taatattaaa attcacaata cagtgtaaca tcacacttaa 19140 
ttcaagatgt tctaaaaata tgagccatcg ggctagctct acttcacgaa ttcgaatcaa 19200 
gtccggggaa ctggctcgaa agaaaataaa tttttaattt ggtttatgtc cgaaatagaa 19260 
atgggaatct ggtttttcat tctgaataat ttccgagaaa cacttacaaa ataaaattca 19320 
gatatcttgc aaaaggaagg ccaaatgtcc tgagaaatag agcacgagag ttttgaaata 19380- 
cctgcaacaa caggatttgc ttctattttg ttttttgaac tgaattttaa actattatct 19440 
attctgaaaa cattttttgt ccaaaaaaaa tcaagaacaa tttagagcaa aatgtggcaa 19500 
tccgaaaatg ttgatgcaac aaaaaagtgt tttttttttc attgaatttc agttttgaaa 19560 
actgatttct ttccaaaaaa aaaacgaagg aaaattttga gaaaaaagtg aaaatccaaa 19620 
aatgctgatt ttggtttttt tttcaaaaaa aaagcatttt gcaaagtgtg tgcttttttt 19680 
cgaaagtttc agaaccttga gacaaaaaao caaaattgtg ttcccgagtg aagcccgcca 19740 
cgtggacatg gtoagacgaa tcttgttcgt gttcgcagcc aattttcatt tttgctgaac 19800 
gcataattgt tcaaagaaga ttcggtctaa aaagacgaaa ttgaaataga ttgtggaatc 19860 
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rattattcaa aaattgagat ggtctcgtga 19920 
ctttgaaatt ttcttttgac aaaaggtcac cgttattca 
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atctttatac atttttcacg gggttcaaaa acaattatga agaaaattgt ttgattaaat 21480 
tagaatgtag attctttata ttttcaatca aaaattaatt ttggaaaaat aactatccaa 21540 
aaaacgaaaa aagtaataaa tgagtacttg aaagtgaaat ggggcaatta aacaagataa 21600 
aaaagactaa aacgtgagac atctcacaac gggtcacggg caagaagtac acgagaaatc 21660 
gaacgtgagt ggggaggcag agacactcag ctgactgcct ggcctgacgc tcgctcacaa 21720 
aacgctctca ctctcttcct cgctttgccc gctctccgcc ccgggtcgtc agttcggtcg 21780 
atccatgttt gttcattttt ataggtgaaa atttatgtaa gggaacggaa aatgtaaagt 21840 
gatcgtggga aaatagaaaa acaattacat tgtaactttt ctggaccaag ttgtacccag 21900 
atgcaatatg tatatttttc tcagaaaata ctgtgttggg tttcgacagg atcgatttat 21960 
caaaagcaaa cgagtgtgcg tctcaacgag cactaaagtt cccaactaga gcatccttgt 22020 
tgtggtagaa ctacatagaa atttttaatt ttgatttcaa tagcttttct cttgttttct 22080 
caaaatttat tgaaaaactt atttactata aaacgaccaa cgacggatct ggaaactaca 22140 
gtactcctta atgcaaaagg caacgaaaaa tcagccagtg acttattttt tgttctggat 22200 
aaaaatcggg aatatttgca ttttgaattc gcactgtatc gataaacaaa acaccgaaga 22260 
tcacgccaaa atgactattg taactaacag gtacgagaaa gggacgcttg ttctacaaaa 22320 
ataattcaac aaattttccc caaaaaaatg tgaagtccgc aattctcgta gttttacgta 22380 
aatcaaaccg agcatgacac tctgacacca cgtgcgcctg aagatgtgcc tgcctaccat 224 4 0 
ggatgcttta catttgctag ttccatgaca ccccatcctt tcagcttcca agatgaagga 22500 
gttcggagaa aattcgaaaa aatattgaga aaaataaccc aaaacattct gaaacattgc 22560 
ggaaaaaagt tagaaattat gtcgaatata tctgaaccaa tcaacaattt caaataaaat 22620 
acaaaaaaaa attggaagac cttaaatagt ctccgcccat attttggctt caaatgaccg 22680 
tacttcggaa tatggccgat ggccgtggca agacctccaa tcgtagtttt gagcggtcag 227 4 0 
taagtgaaga ttaaaatagg aacagtaccg taagatcagc ccaggtgfcgg atgtgggata 22800 
gaggaactga aaataatcga agaagcatga taactaagcc acgtggccac gttcgttttt 22860 
gcgatgttaa tagatcgcca cttcgtccat tgtcgttttg tttgtactaa gtctccttag 22920 
caattctctc gaaggcgggc cattgctatt agtaaaataa gctaccaatt ttacctttca 22 980 
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., M «t=.t tca« g ,t„ tttt.et.tc agg t ga tcat ttttttgtt. ttctc.stta 230.0 

«««=«. ...ta.t,,. g ttttt g ett ccgct.tt t gg tt g ..,t ,.t,.c.t.t 23!00 

vtntnctctt tttacgtctt acaacttctt tttaagtgtt 23160 
ccattgattg tcgtctccac ttgtgctctt 

, t „, 3t ttattttttg cagaaaatga gcctgttcag caaatttttc 23220 
ttgcgtattc actgtttcat ttatwi-Ly ^ 

ggaggcatga tgcaagaagc t ccgattact ccacaagaat ctattcaaaa acttcgggaa 23280 
acagaagata ttcttgagaa gaaacaagaa ttcttggaga aaaaaattga cgacgtaagt 233,0 
tggaagatca gttttggtcg aattaatcac attaaaaagt gctgaaatcg aaatttttaa 23400 
actctcgagt ctcaagtgac tgtgacgtaa ttaaaacatt gctcagcatt t.cttgttt 234 60 
actgacgtct t ttcgaagtt tagtcgagca aatccaaaaa agagcaataa aaatttctgc 23520 
t acgatacgt ttgggaaatt ggaatcatag ttttttaaac tccatttttc aaaaaataca 23580 
ttattagaaa- atcagtaagt ttcggaaatt atttgagaaa cgtttcagga aagcaaaatg 23640 
ccgtgaagta tggaacaaaa aacaagcgga tggctctcca gtgtttgagt aggaagaaag 23700 
ctttcgagaa gcagttgatc catattgacg gagt.ttggc tactctcgaa catcaggttg 23760 
gtatataaaa atattagaga aataaattga ataacacggt ttttcttcc. gagagaaacc 23820 
ctcgaaaatg cttcaacgaa tgctgaagtt ctcacggtta tgaaacttgc tagcgatgcg 23880 
ttgaaagcgg ttcataataa catggatagc gaccaagttc gtgatatgat ggataacata 2394 0 
g a t gaacaac gagaagtggc gaaggaaatc gcggatgcta tttcaaaccc tggctttaac 24000 
aacgcaattg acgaggccga tttgctgcgc gagttggtgg atcttgaaca ggttcgtcta 24060 
taccaccaac atcgtgtaat tattagaaaa tataccagga agcacttgac aaagatttgc 24120 
ttgatgcgag agctccccca gtcacgcttc cggatactcc caatattgca cttccagcct 24180 
ccagaccgag agctaaagaa gctgacaagg atctagaaga cctcgaaagt tgggcaaact 24240 
..cttctct. agtcact tt c atatttaatt ttcggctatt tttgtttcat ttgcatcccc 24300 
ttcatcaatc ctaccattct ccggagattc tcctaaatca actttctaat tacgacaaat 24360 
tC aaatagtt gaatgatttc tg«t.gcea tttcattcga aacaaatttc cccaaggo.a 24420 
cg atcaacac tcatcaaaat tgtaacat.t t a t cgagct t tttggaaatt t gtcat ttt a 24480 
t aca t ct t gg teeetttctc caaaatcttc - caagcatgca ttaaagttcc aacUttatt 24540 
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aaaaattcat tctggcaaac atgttatttg taccggttga aaacgaaaac caagcgagaa 24 600 
atagttacat ctcagatctc cctaacgatg gctcaacccc tttgacgctc atttactaat 24660 
gtttatactt ttgctcttt actaatgaat ggctcattta ctaacttgct g.g.tttttt 24720 
aatttactac tgctaattgt aagatatata tc.ttt.tc. tttactatat ataaagcgct 24780 
tattccgttt gtccatagtt tgtagtctat gtagtctttg tagtctgtga cgttttggct 24840 
tctggaagga tagtgagttg ggcttagtgt agggatatag ggggtactgt agtggtacaa 24900 
tagtggtacg gtaggagtac tgtatgatta cggtagtttc agaaaaatta gttttcagct 24960 
ccagaagtcg ggggccgcgc cggaggtgcg gtccacggct ggttttacat aaggtagttc 2S020 
caaaaaatgt cctacttcca attactcata actcagttag cgcgctatag ctatagcgtt 25080 
tgagtttaaa aaaattgtgg ccaactgaaa tgctgtttgt cagagatgcg agctctaaaa 25140 
gatgatcgaa .t.ttct.tt tctgcgg.tc tagaatattt cgatcatctt ttggagctga 25200 
catctccgca atcgctaaag ataactaaaa ggtaccaatt aacaaaatgt gttttacaat 25260 
attgccaaca acattttagg tttctttcgc tgattgtttc cttttggttt tggtgatggt 25320 
cccggagtgg tttttttcgc tggttctact attttttgga tcggcaggct ctgaacaatt 25380 
ggttgtacaa tcttcttcaa cttcatcaaa ttatccagag ttatgttgtc gcttctgctg 25440 
tccaacatat tcatgcattt gacggaactc ttcaactttc tgcattgtca ctggattctt 25500 
ctttttcgat ttt.tttt.t gaaaacttta ctatcataaa caatagtatt tatcatgtta 25560 
caaatcagtt tggaatgatc tccttcattc aaaattctta atgatcagtc gattcactct 25620 
tagagccacg aaaaatgtgg gacaattgtt tgagaagtga aaaatagtta ttaatgttgc 25680 
aattagttgt acatataagt aatacatgaa aatacatctt aaaaatacag ttactactag 25740 
gtattattgc ttaaaattgt gttccaatct gccagtacta tgagcgtaat tcgttgatcc 25800 
aatcttcgaa tagccgtgag cacaggcttc gccggcactg cacacaaact tcacgattgc 258 60 
acgatttgca gaggtagagg acgaacgact ttcctgtaat tggcgaaata ttgttttaag 25920 
ataaagttag taggaacgat cgtactgttt ttagaacgag actgtctagc tggtggccgc 25980 
atcgagcatt gatggcatcc aagaccttga acttcttcgc tgaatgatat acgatgcttg 26040 
aatatggatc cactgaaaat tgaggttata gtagattatt gggagctatt atgatttcac 26100 
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anattctcqc tatccttttc accgcttttt 26160 
ccatgaagaa ctgcgtcagt aactcgtttc agattctcg 

cgttgtaatt ctatgagaaa acggtagaat ttgstgacat ttgtcgagtt aaacaattcc 26220 
acgaggcaga caaacatctg aaatttgcgt -ttccaca aatgcataaa ctttcaataa 262S0 
aacaaaccgc ttctagggca acatcagcta aac^tc atgctcgtat tcggcgttta 263.0 
gcga gaagca taaatggtag aataaatgaa agatatcggt aggttcgcgg gaatccggat 26,00 
tgtagtcttt gagataa.ca acgoaatttt gtttc^t cgtcatcagg tatttgtcgc 26,60 
atagcctgag aactgtgcac acgttttgtt -gaaaataa atttggcatt cattgaaact 26S20 

, raataacat ccggatataa accaagagaa ttgggagaaa 26580 
acatcgatca tgaactacca tcaataacat ccgg 

tg ac, gtg . t c.ac^a,. Wt9 " tla 266,0 

lg , c ,« ttc « g «=tc=a 2 " 00 

c**.«,e t^tccc „,«c«« -,.™= ™< - 2 " 60 - 

g c« g c«, t cactcaaa. ^ ts a. a g a.a..tc g «..«-. »» 
cg t« g t«. e g ta. t c gg a te=«cta gg ««««» *«"°" 9t 26880 
, g ,. tg ac gt «-«.«« ^™ "'"= t9 " 269< ° 

tg . tg aaa« « S ».^ ™«" """"" """"" 

nctaaaca at taatatttga ttacaagttt ggaaagatct 27060 
aggtctcata gcgccgagta gctaaacaat t 

«e t ,. g c t c ,a t ca 9g .. 5 aa.aacttc* t g a,.ct tt a " 12 ° 
cgt ,t.a.ct tta »«t, C a tg aata..t ««««« <*.«-« " 18 ° 

tttt .,« a , ,«c t c tOT c — • — "»"" 9t 2,2,0 

tccc—t. »«.«**= ««««« «cc« t9 t g t =<taca gg = ,7300 
g „ g ,aa t a g «.«..«, « g ctcta tg <taaa gg c g = ..,«««.* 2 7360 

, nrtcart cctcggcccg atttgttctt cgtttgcgcg 274 20 
ttgagagacg ggtctcgcaa cgtgctcact cctcgg 

- ^rrrtc atttttttct caatttctgc tgaaataagg 27480 
gttttcaggc ctttaaaaga tagttccgtc gtttttt 

■ , aaacatttaa actcatatat tcagaatttt 27540 

tttaattaaa tttattttca aaatcttggt aaacatttaa 

e.«e«=« t=a===a g aa aa=c,.a t « caa»«aa, a t taa g ..c. catcta g ..= 27««. 
, tg caa.aaa «=««,« «««=«= «««.««. ««—— 
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aatcatgaaa tacgtccatt attgttgtat cccttgtttg catccaaagt tgactcgatt 27720 
gatctcttaa atgtggtatt ccgttcgaaa ttcgattgat ttttagaagt taacacattc 27780 
ggaatgatga taattcgtat caaaccaaaa ttgtcttctt ttcgcctttt ttgtgcagtg 27840 
tcagcattaa acaaaacgag aatattgaaa gttacgtggc gtttgcatct ctcaccacga 27 900 
tgacatcacg aaatgcagac gacaaagacc ggtgaaaaat agtgcgctga atggtgaaaa 27960 
cttgcgaaga taacgtgtta cgggttgaga gagaaaacat tccgcgagac aatgcttttg 28020 
gtgagaggcg cagatggttc agagaacact agagaaaacc gcgcctctgt ccgctcacag 28080 
ccagccccat caagcctctt cgggcatcga cgcatagaca cacatcattt tgccccaatt 28140 
tcctttcatt ccgtcaagta tttcgcaact aatcgttatt gctcattaca atacacattt 28200 
tacagaagtt cctcttcttc tacttggtcc gaccgcatca gataactggg agatccagtt 28260 
gtgcatgttc ttgtgccc.c acaaactcgc gcccatttac aattttatga tcgacaaccc 28320 
tcaagaaggt aagcatttaa acgtgttggc cgtgcgtctc aaaaaattgt taaaaaaoct 28380 
ggcgacacgc gtttttocac aatttcattc cctagggcat tttgtatttg aagtaattct 28440 
attacgcgta cgcaatcgga cgaatcctgc aggtttgttg gtagtcaatt ttatcaagtc 28500 
gactgcctct tatgctttct gaaaaaagag aatgacagtt ttcgctaagt agtactaaag 28560 
cgatctttta tctttggcaa aaccttgata taagcattat cacagcatat catgcagatt 28620 
gatttagagt taagcatgaa atgtgcaagg ctaaaataaa ttacaaaata agtccatagt 28 680 
cc.tttt.gt aacagtatac atcagctgat agaatcacat gcgtaatgac aggtctaaaa 28740 
cattatcaaa caaaagacat tacaaaaaca agaaaaatac aatataatag aacgactatt 28800 
tgaaatgagc gtagttaaat tcggaacttc aatagattat catacgcgct tttaaaaaaa 28860 
tgtgtgttcc cttttctccg cgtttgcccg ctacaaaccg gtgagtcgga aggcataatc 28920 
gggttgaaaa aaaagtatca aacactgatg gtgtcttttt tagggaggtt gtccagaaag 28980 
agaaagaaac tgaagatttg cgaatcgata gcgtcgtcat ctctcgacgc cagtgaagtc 29040 
aagatcggtt acaatagtgt atgcgattcc caaaatccac atatcaaccg gactcgtgat 29100 
atttatcatt tgtaagtact aacaagagat gtgaacgtat ttacactcaa cattagcaaa 29160 
ttccagaaga agatctaaac aaaaactatc gaaatggctc tcaacgtgaa ccgcgctgtc 29220 
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gctgatccat tctaccgcta caagatgccc aagctgtcag caaaagtcga aggcaaagga 29280 
aacggaatca aaacggtcat ttccaacatg tctgagatcg cgaaagctct cgagcgtccg 29340 
ccgatgtgta tgtttatcgc cagttggctc gccattggac acaaaaataa ccattgtttt 29400 
tcagacccca cgaagtactt tggctgtgag ctcggggctc aaacgaactt cgatgccaag 294 60 
aacgagcgtt acattgtcaa cggcgagcat gatgccaaca agctccagga tattttagat 29520 
ggtttcatta aaaagtttgt gctttgcaaa tcatgtgaaa acccggaaac tcagttggta 29580 
cgagatcatt gaattaataa tctgtctaat tttattattt cagtttgtcc gtaaaaataa 29640 
catcaagagc aagtgcaagg catgtggatg ttcgttcgac attgatctca aacataagct 29700 
g t ctacattc atcatgaaga atcctccaaa gattgatgtc gatttttgta agtatcgttt 29760 
actaacattt ttcgattgaa cttatgcaaa attctgccaa aaattotatt tgcattttaa 29820 
atcctttcaa ttcgattttc cgtgtgcttc cagtgcatac aaacatgcta atttttggtt 29880 
tccagccaaa gccgaacaaa agaatggaaa gaagacatcg ggtgctgacg ccgccgccgc 29940 
cgtggctgcc gacataatcc acaacagcga caaaggcagt tcgaatgatg acgacgacga 30000 
cgattgggaa cctgaaccag tcgagccgaa tggcatgctg tcggcgggaa tgggcaagct 30060 
cgtgctggac aaggatcttg agaagagcga agaacagcgt ctcgacatgc ttcacacatt 30120 
cttcttgaaa gccaaggaag aaggtaagaa ttctgagcat tgataaaaag tattctcgtt 30180 
atttcagata gaatttctga tgccaaggga caaactgctc tacgtgacga agctgagaga 30240 
cttgagctga agcaaaaagc .tctctcctt ctcgcgaacg tttttcttga tgagaaagta 30300 
atcactgaca aacaaatcag caaacaccgc aatcttctgc ttcgcttcac gttgaatgac 30360 
aagaaagctc aaagatacct gttgggagga gttgagcaag taattcacaa acatgaagcg 304 20 
gaacttctgt ctaaatcagc tcacatcatt aagtcattgt atgatgaaga tgtctgcgaa 30480 
gaggattcgo tt.tttc.tg gggagagaag gttagtacca aatggagctt tgtttcgaat 30540 
taaagtttat atttacagcc gtcgagtaag tatgtctcca aatcttttgc caagaagatt 30600 
attgagaact ctcaacoagt gctcaactgg ctgaaagaag cggaagaaga aaccgaagaa 30660 
gagtccgacg atgagattgc ggtaagaaat atcagatttg tttttttttt ttcaatggtt 30720 
ggttttcagt tcggaggaga cgtcaaggag agtgaattcc ttcgtcaaca gaaggagaag 30780 
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franpa aaaatcagcc aaggctacaa acggcaatgc tgctgctgca 30840 
gctgctagag aagctcagca aaaaicay^ w 

^^rrrr^i- attaatqaca tttaattgta cagatgcttt 30900 
tccggagcaa atgatgaaga ggacttggat attgatgac 

r~+=> ri-tatatttt ttgtgtattt cttcccatat tcgaaccaat 30960 
tttaaaattt acctgggcta cttatgtttt y y 

t caactaatt tcgaagaagc ctcgttttt ttttgctttc tccccctttc aatagtaagc 31020 
atcatttcat ttctgtcttc tgtcttttct gttcctacgc tgttttccct tcaccaaatc 3X080 
caattcattt attcgtaaag tcattactat ttgttgttaa tcgtaaacat ttgggaatat 3X140 
tcttgttcaa ttcagtctta tattacaaaa acacaatgtt caaaaaaaaa gaatcacttc 31200 
agatgggaac ccgtcgaatt cggcggtccg atggagaata cacattgttt tttcggaaag 31260 
ttagcccatt ttcaaatcat cacccagctg atttcatttg cgacgaagcg ataaattgta 31320 
aagagccgaa aaccttttgc tgctcggaac agtactatat g t acaataag gcttcactat 31380 
tg.tgg.ttc ....ctg.tg-_gcgcg.tte tagaagcaac ttgtccgaaa acaatgaaga 31440 
caatgtgttc taaatggtcg ttgaaaggat ggaaggattc ggtgtaagtt ttaaatcagt 31500 
ttgataataa aatatgtttt tcttttacag atgggatgag aacaaagaag aagtgatgag 3!560 
aataggatgc ttggcaaaat tccgtgcttc tcgccatctt cgttatgctc tttttctcac 31620 
aactggtagc aaactagtcg aatgtagtcc gttcgataaa atatggggaa tcggttagtt 31680 
tccaacggat cgtcttattc ttccatcgcc catcacaatg caatcagaat cttcaaactg 31740 
gaaatgtttt gaaatcattg aaatcatctt tgagctgata tggtgacgga agaaaaggac 31800 
gtctgaaaat ggctgaatta ttataggaaa agatatgcaa gccgcacaat gggctccatt 318 60 
gagctctggc aagaatctgc tgggaaagat tttggatgga atccgagagg aattgtggga 31920 
tgattcaaat tacaagttag ctctggaatc agaaaattat t.tt.t.ta. aattactatt 31980 
t cagagatga acgagaagaa gtggagaaac gaatggaaac tgaaagagat tatctattca 32040 
ctgctataga gcacatggac ttgatgtaca aagaaagagc aacaaaaaga gtattgtaag 32100 
aatcagaaaa tctgcgtaat tgtcgacaga aataacgtat tccagattgt tcgaggaaga 32160 
attgttaact gatgatagat cctacatcac accagatatt cagaggctcc ttcccgactg 32220 
ggcttggccg ccgatcctcg tgaaaaacga gcctattcaa ccatcgctgc ctgtaataat 32280 
cgatttccct aggtacttgc cttgatct.tt aatttatcag aattaacttt caaattccag 32340 



„ frt M „^ A PCT/US99/01313 
WO 99/37660 

-67- 

atcatctcca cttcgagcag ctgaaatatc acgtaggaag agcacatctc attcgacaag 32400 
cttgagtaaa aggcggtacc tcaggagcag atcgagaagt ctgtccaaaa gcccggctcg 324 60 
aagacgctcc agacatcttt cccgaagtgg atcccgtaca ccagctcaac ggcattccag 32520 
aagatccgaa agtacatctc gaagacgttc cggacggcac tctagaagtc gatctagaag 32580 
cccaccacga aaacgtccgg tacgccgatc aagaagcaga tccaggagca ggacaccaaa 32640 
ccgaaattgg acaagagcac ggagcagaac aagaagtcag gctaaaagta gcagcacttt 32700 
aacctggcca ctgagcccat cgagaagcag aagtaacagt aatgaaagga atttgaaaga 327 60 
gaagaaagac cggaaaaaga aaaaatctga gaagaaacgg aagcatcatt ctaaatccag 32820 
aaaacaccgt tctaaaagat ccgaatccag agaagaacgt cacagaagac ggaaggagaa 32880 
gaaaagagag aaaaagaaga aacgacgtcg gagaagttcc actacttcag attaaacttt 32940 
atttttgaaa" actagtcata actttaaaag tcataacttt tttaaaagtc ataacactgg 33000 - 
tttaatatca aatgtctttt caaatattct ctatttattt attcttcgta attaaactga 33060 
gattaagtac tgggtatatc attaataaaa ttacgatact ttgccgaata aatcagttat 33120 
aattacaatc tgtctgctgg tgaaaattgt acatgctatt ttcttgttcc tcattctttt 33180 
ttcattctct gtaaggtttt gttcgttttt tggaaaattc tgagagtagc cggaaaaaaa 33240 
aaaaaaaaaa actaaatacc tacagtaatg ccagaggcat atgctcaata attatcaaaa 33300 
attagttttc cgcggcgaga cccatcccca caaaagtatg actcccttga aagtcgtaaa 33360 
tgacaatttc ttgaaacaag aacatttgta tattaacgaa acacaaaatt ccgagaatgc 33420 
gtattgagca gcatatttgc cgagccaaat atctcgtagc gaaaactaca ttaattctta 33480 
aaaacactac tgtagcgctt gtgtcgattt acgggctctt tgaattatca ttgatttatc 33540 
gatagaatat ttaaaaaata aattcatttc gaaattagag cccataaatc gacacaaaca 33600 
ctacagtagc catttaaaga attactgtag ttttcgctat gagatatttt gcgcatcaaa 33660 
tatgttgcgc aatacgcatt ctcagaattg tgtcttccgt aataatagac agtggcttcg 33720 
ctaaaaacta agaacaaagt aaattaaagt ttttttctgt tcacttcaaa ttttacacga 33780 
. tcttgaagca aagttcaaaa gagcatgaat caattggaaa gtgttcaatg caccctacag 33840 
atatgatttc ggggcagtgt aaactacagg gcacagacat aaaaatttaa attgttgaag 33900 
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actaaaatat aaacatatga attcaagggt 
tattaaatgt atgcatacaa ttaaatacaa 
caacctgtcg gacaacaact ttgctgaggt 
cagctgaccg gctcagagac gatacatcgg 
cgacgcgttg gagcacaaca ctcacgatat 
ggaaactatg tggaatttag gtggatcacc 
ttggagtgca gtacatgtaa tgggcagagc 
ttgcaaaggg cctggagatt ggcttggctg 
taccctaaat ttacgaaaat tatgctaaat 
tgatctctga aaacttaaaa atatcctttt 
cagagaccaa aaacactact ttttcgacat 
tttgaaaatc tgtgttttct catacccgga 
ggcggtgagt atttctattt tatgaaagtg 
gttaattctt cgcggcaaga cccatcaatt 
atatccacgt catcatagaa tttgcactcg 
atgcaccagt ctatatgggt gttgggaaac 
agaatagaca gtagtaatag ggaagaatat 
aataaattta attgagaaaa aaaacggttg 
ggaagtgaat cttgagcacc tcgtaaatca 
agggattaag aggcgcaagc tttgccactt 
aaattggtgg acaaatagcg gaaggttagc 
acacaatgga ttttacaata ggaatattga 
tgaaggtgtc aatatttgag gtcaactgtc 
aaaaaagtgc ataattcaca gattgaaatt 
ttatgaattg atggtggcaa caaaacacaa 
aatgggcata ttcgcacaca ttccacacaa 
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cataataaat gtattttttt aaataatatt 33960 
cataattatc aaatacaaat attataattg 34020 
gtcgtgtgac agtcagaatc cttgtcacac 34080 
aagttgagat gagtgactgg tggacattgc 34140 
cgagtcatgt cgatgcagcg ctgaaactca 34200 
caaccagctg cccttcaccg cactgataat 34260 
attgctgcat ttgcatcaca atcaatgaat 34320 
aaagagttga tattatttct attgatataa 34380 
taggatttta gttataatcc tcgtcacatc 34440 
tggtagtgtg gcaccaaatt cgtgctgtaa 34500 
ttcctctcct tgcagcgaaa aataaaattt 34560 
aaaaaccaac aaaaacggcc ttgttccaaa 34620 
gccgagattt ctctttttct acgccaagta 34 680 
ttctaacctc taatctcttt ttcaacatga 34740 
ggcttataga tttggagcct ttgaaagtat 34 800 
gaataggcag tagttttttg gaccaattgt 34860 
aagaatttca taattcagat ttcaataaaa 34 920 
atattctttt gtttaagcag acaagtatgc 34 980 
cgggaggcgt acttgtacag aagagagata 35040 
tgaagttaaa aaataaagaa agagacatgc 35100 
gggaggtggg aggggggaca ggtgcatgta 35160 
aaatacgcat atgggaaatc ggaacagata 35220 
tggtttttcc ccgatttttg aattttttga 35280 
ggaaattggt cgagaaaaga ataaggagtg 35340 
attctacatt tgtaccaaaa tgcccactaa 35400 
attgcataca tattccacaa tggggaatat 354 60 
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tttgaatatt tagattaata aagatgaaat aattgagttt tatttgtaat taaaatattt 35520 
ttctgtttat cattaattga aaatgttgaa ttacttttta atagacgaat catcaaagaa 35580 
cttgatccct gcattatcag gcaatcctac ataacctttc aacgttgtcg ttttaccaat 35640 
tgcaacattt ctcgctactg gaacacgcat actggaatac gatgacgatt ccaattggaa 35700 
gaatatattg gtgcccggtt ggaagttaac aattgaattg ttgttaagcg ataaaggata 35760 
cacattgata acatccaaaa gttcagttat gtatatccat ccgtataaat cttgcgatct 35820 
tccattcacc aaaagctggt cgccatcttg tataggaatg aatggagtta aggatcccgt 35880 
aacagtacga gttgtgagcg tagttccact gaaaattact aaatatttag ttcaaaggtt 35940 
ttctgttact actttttggt tgcaacaact ctgagaaatt ttagttttca ccaaaatttt 36000 
tcgattttgt acagaattgc acaatatatt ttggaatagc aagaaattgt tcagtgaatg 36060 
tcaaatctga caaaaaaaaa tttttttaaa aggtgcctat caatttttaa aaatgttcta 36120 
atattttgtt ggaaagtttc aataatttca ctacatttac tatttctttt ttaggcctat 36180 
tttgggtatt caaaatatta accacacgac cttcaataca ggaaaactgt caaatttttt 36240 
ttaaattatg aacaattaac tcactttaca ttttgtcctc cattccttgt agttaatata 36300 
agacttccca acgcttcttg agaactattc gaaataatat aaatcttcga atttcttcct 36360 
actatatatc ctagtgtgtt gctcgttgca acgtctagag tatccaatat aaacccacta 36420 
gaagctgata taaagaaaaa taatagaaat atatttttca ttttttccaa atgactaaat 364 80 
gaccaacttc aagacatttt atatgcttaa aatcacgtca cagaactata atcatgttga 36540 
tttttgatag aaaatgataa gaaatgcgac caaaatgtgt attttctccg tttgtcctct 36600 
gaatgagtca aattcacgta aaacttggca tttgtcacag tgtgtcagac acaaggcaca 36660 
tatgtattta ccggactttt caagacttta ttattattga gatcaaacca gattacagaa 36720 
gacgggagaa aggtaccaac aaatatcaga atattgcaaa aaaaaattaa aaatttcaaa 36780 
acgcaaactt caaactagga gagctaattc aaactttgaa atcatgttcc ataaccggta 36840 
gcatttgttc ggtgacttgt ttgacagccc attgaaggaa gagaagtact cccgacaggc 36900 
tgaaacatat gaaatagtcc aggccttcca ttagagaatg tgatgtttga aggaagaaca 36960 
atgggacgta gagtactccg aatagagcag taagtccatt gatgagctga aacagtaaat 37020 
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aatcgaaaag ttagtaaata tgttcaagga atggaagtaa accggaatta tccgagtatg 37080 
ggcgttttat agttttttct ctttttttga cttcgttttt catcctatta aaatatcatc 37140 
ggttttttcg agttccagaa aaaatattta aaaaatcatc cgaaatccga acacaaaatc 37200 
cgaaggctac tccaaggtaa gttaacccta ctcggcaaat ctctcgtcct ggagcgcgga 37260 
cggggcgcga ctagatcacg ggttcgcgct ccagtcaccc tttttttcgc gcttcttacg 37320 
cgccacgtcc gcgcttcagg aggagcgatt tgcggagtac cttttatgca ttcagactgg 37380 
tacttaaaaa ttaatcgatt tttttaaaaa gtgtcataaa ctttttctac gtctttttct 37440 
gacacaatgt tgaaccgtac tagattgttg taaacacggt cttcaaattt gattttcgcg 37500 
aaaaaatttg aataattttt ttctaacttt tttcttttta aaatcttacc acacttagca 37560 
aataaccatg aagcacaact tcataagtgg atcctatttt tcgtttgaag aggcaaaata 37620 
ctgaaaacaa aagagctgat atggagcaag acacgtggat ccagaagagt atacgcacaa 37 680 
tcacactatc cccttcgatt ttgacgcggt acagaattct ggaatttttt tttgaacttt 37740 
aatggattgc gattcaaaag aaaacgtagc ttaatctcca gttaaagctg attttcattg 37800 
caaaatgtat ttagaaaaaa ctcacgctaa taaggcggag agtattgtct gtagaaccgc 37860 
catgattact gtagatgcat agagtgagaa tgagcacata taagcgctcg gctgtttttg 37 920 
aacgacaatc gaattggccg ccatcatctc attcttcgac ctcccgtttt atttetgaaa 37980 
atatatgaca ctttttaaat gaattgacag aaatctgatg ctaactacat tttaacttgt 3804O 
aggagtggtt caaatgattc ataaagggaa tacaatttct gaatgatcaa agaagaaaga 38100 
aaaaaaatat tggtgaatgt ataatttttt aggggtaaag taaataaata aacacaaggc 38160 
cgaagattag caagagtttg gggataaccc ccgtgaagaa aaatatgaaa aaaaatggtt 38220 
tgaaagaatt aaaaaaatcc tttcaaattt gagattcaaa ttttgttcat ctgttctgtt 38280 
cgaacattga gcagaagaag cttttaccaa taaatccaaa atttgttaag agaatatagt 38340 
ttaaggatat cacccagttc aaaatagtag ttcaaaaact cgagtcttaa ttttttcagt 38400 
attcgaattt ttacagtaca ttgatcgttt cgttatttga tcgctttttg ataaaacaaa 38460 
aaatagataa tgaagct'gcc aagtttaaaa aaatcggggc taaggctaat ggagcataca 38520 
cggtatatca ctacctggat attagtttta gacttcatca gatatttagt cagaaaagta 38580 



PCT/US99/01313 

WO 99/37660 

-71- 

cgtcaagaag tcggatacga aatgtataaa tttcttaaaa cttaaaactt cgagatatcc 38640 
agactgtggc tctcaagctt cagtgcttgg agaaatagtt taatagtcag aatatgtttt 38700 
aaatttctta atttttctga agaagtcgta aaagtataaa tgttgctaga tcaaacactc 38760 
tagaaaacct tcaccacttg agaatactcc agtctcaaat tttccctcga cgcggaagtg 38820 
tagaagggcg cgagattcag aagtaggtga aaattagacg gaaaactctc tcaaaattga 38880 
aatcaatgaa taggacaact gagacaatgt gcaggtgtat gtgtatgcac atggcaccca 38940 
cgtacacgca tacatcttat gttagagaag tacgtgtgct ccgctcatca tgtcttctcc 39000 
ttctcctaca tctacatttt ttgctccgtg agccacgccg ggaaaaacga cgacgacgac 39060 
ggcgacgggg gacgactact cgactctaat tggccctaaa cgcaagtaaa tttttaggca 39120 
atgtatgttt gcgagagttg agagccccac cgccacgagg agaagtgggg gaagattccg 39180 
aagagattcc ccctcctcct tctgatcacc tcgtctttcc ttttttgttc catttccgtg 39240 
aaaaagctgt ggaagggagg agaagaactt accggctaaa tggaaaaaaa ggaactctaa 39300 
cttattctga ctctacggaa ataggaagcc tacttgtcaa ttagaccgcc ctcgcacaga 39360 
tttctttttt tttgtagata caaatataaa aactaactgc gtgtgatgca gcagatatct 39420 
tgaattggaa agtgtcagtg ctcagaggga atagccaatc attgacagaa atttgactac 394 80 
ttcagaagga atcaactaga acatttgacg cctgaaacct aacaagaaaa atctataatt 39540 
tggagatccc tagattgatg ccaactttat taaaaactaa gtatacttat atatatacga 39600 
tttttttaaa aataaacctg attgtctgaa tttctacaag attgcgacca aattttccgt 39660 
atttccaaaa tctaatatta ggggtttcta ctaaaattca acgagaactc ttaacattat 39720 
ggttatttta acacatggtt caccgccggc tcaaacttca ttcttagtcc tctgattttt 39780 
ggtaaatcga cgcctacgtc tcaacaatta gtttgtgcag aaaataagta aaaagagttg 39840 
tgctccatct tgcacacata cacatcgcct gtaatgaaga ggttcggagt cagatgacta 39900 
ggcgtagaaa tgtgcgaaat tcacggataa cagagatttt tgatgtttca tcagacttac 39960 
acgttttgga agtatgaatt gggtctagac aacggagtgg cagatgttcg gaaaattttg 40020 
cagaaaagag aacctaagag cgttgatggt ttggtgacta acgaacttaa aagaaaattg 40080 
gtcattgaaa attttaaaat tttaaatttt gcttgcagtt catctttctc tattaacaaa 40140 
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aattattttg tagcttttct caatttcagg caattaaaac atttcaattt attcttctat 4 0200 
tatggaagtt tatctctaat tgaaactctc caattttgat caaagaacaa acgttctcgt 4 0260 
tgtttgaaaa aaaaaacagt tcttttttga aactcgcgcg caaattatta accaatcatc 40320 
ctcgtttgcg cgcaaaattg tagaaaaaat eatttaaatt tatcaaaaat agtttaccat 40380 
tctgatgagt ttttcatata caaaaatgcc ctggcaattg ttgttttctc tgaaatagca 40440 
cataataatt gaactctacc cacataaagt tcgttctgaa aaacacctta caattattgt 40500 
gattgagagc caccccaaga gggattagaa aaacggatgt aatctgtata ccttcgagat 40560 
tcgtttattt ccttgtataa ccaatagcag gaaaattaca gctttttcta agtaagcggt 40620 
gaaactagag agattctata gaatatgggc gttaataatt gtatgttaaa gttttagaat 40680 
aacacaagtc cagagtaagg gcaagaaaag taatgagcaa cggaaaccag catgcaagac 40740 
acccgaattc cggttctctt ctgaaactaa aagttgcgtg tactaaacct taaaccagca 40800 
gctggctagt ctcaagaaat aatagaaaaa aggaaggaat gaagatatgg gaataataca 40860 
aattgaaaat gttgtgtgag ctccgaataa ttttcaatat caaaaattta tgaattgtgt 40920 
ggacggctgt gtgtgcgtgt gcgtatgcgt cggcaagaaa aagaagcgac cgaataagaa 40980 
aatggttgat tcagtgaaca aaaaaagaga gaaagatatc caaacaaaat tattcaaaac 41040 
tattatcaat cggtaggtat tgctctagag cacacctttc tggacactca gcagacatgc 41100 
gtagagaggg attatgtggt acatatagtg gatggaggaa cagatattta taaatactta 41160 
tggaaaagag gatgaagata ggatgaggta gatgaattga gaagatttta aaatgataat 41220 
ggatattgaa tttgaataag gagattctaa attatccgaa gaacacaaac tatatcaaga 41280 
ctacaaaata atctagacga gtcccagttt tgcaaggtaa ggattaatct taaaaggatc 4134 0 
ttttaaatat ttatttcaat gctcctataa attttaaaaa gtaggtgcat tctaatatgt 41400 
acagtgatta ggagatatgt gacgttacgt gaggtctcga taaagtacgg tattcgagct 414 60 
aaatttcaaa cattgtcaag gtagattcgg tacacagcca ccataaatgt tccactaaaa 41520 
atgtgttgtc cttctccttt ggaacacaaa tctagctgct gaactttttc acttcactac 41580 
atgtcaatgg gattgatatg catctaggac atttttttgg ttatcaatag tccgcatagc 4164 0 
ttgcgtaacc aatacaaccg attgtccaaa aaaatttgaa cactacaaaa cgtatttatt 417 00 
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attcggatac ccgttgcatt tcaatacaca agttgatact tgctgcccct cggggctctc 41760 
agacactcat tgactgaaaa cagacgattg ctcgtcgtcg tagtctgaag gctcggagag 41820 
ctgaggaaga tatgaggaca taatgaattg atgtgtgaga atgagaaaat gaaaaaggaa 41880 
aaatgagaaa aaaaagatga tgaagaatgt acaaatgaat aatcaagtag caatgacgag 41940 
aaaagaacca ggtccttttg gcaggcaatt ttcgaaattt tcagatcaaa tttgtcgcca 42000 
ttgcttctgg attaataatg gatgacgctt tgacaatggt gctcaataca agtgcaaaca 42060 
gattggtttg ggatggcgta tagaaataga gccggtgaga cgatgtgatg aagttctgag 42120 
agacgagatg tgatcgaggc gtttgtagtc gaggcaaacc gaggccgcat atggggttcc 42180 
gataggcaat cggagaccag tgtccatctg aaagagataa aagttattcg agttgtgaat 42240 
gttgcaagga aaattaaagg tacagtagag acaatcgaga cttttttcgg gaggacgcca 42300 
tctaaaaact gtggaagcac gtggctttgg tagcttgatg tcacagaagt tgattccata 4 2 360 
agaattacat tagaaagctt gcgacgctaa atggataaat ctggtaacgg cttcctaata 4 2420 
gcaagttaag ttttttcaca ataaattttt cagaattgaa tagatgcatt ttataactta 42480 
cacatcgagt gggcacgttg gtggacaaga caagccccga t 42521 

<210> 24 
<211> 4434 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 24 

gccgccctcg ccaccgctcc cggccgccgc gctccggtac acacaggatc cctgctgggc 60 
accaacagct ccaccatggg gctggcctgg ggactaggcg tcctgttcct gatgcatgtg 120 
tgtggcacca accgcattcc agagtctggc ggagacaaca gcgtgtttga catctttgaa 180 
ctcaccgggg ccgcccgcaa ggggtctggg cgccgactgg tgaagggccc cgacccttcc 24 0 
agcccagctt tccgcatcga ggatgccaac ctgatccccc ctgtgcctga tgacaagttc 300 
caagacctgg tggatgctgt gcggacagaa aagggtttcc tccttctggc atccctgagg 360 
cagatgaaga agacccgggg cacgctgctg gccctggagc ggaaagacca ctctggccag 4 20 
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gtcttcagcg tggtgtccaa tggcaaggcg ggcaccctgg acctcagcct gaccgtccaa 4 80 
ggaaagcagc acgtggtgtc tgtggaagaa gctctcctgg caaccggcca gtggaagagc 54 0 
atcaccctgt ttgtgcagga agacagggcc cagctgtaca tcgactgtga aaagatggag 600 
aatgctgagt tggacgtccc catccaaagc gtcttcacca gagacctggc cagcatcgcc 660 
agactccgca tcgcaaaggg gggcgtcaat gacaatttcc agggggtgct gcagaatgtg 720 
aggtttgtct ttggaaccac accagaagac atcctcagga acaaaggctg ctccagctct 7 80 
accagtgtcc tcctcaccct tgacaacaac gtggtgaatg gttccagccc tgccatccgc 840 
actaactaca ttggccacaa gacaaaggac ttgcaagcca tctgcggcat ctcctgtgat 900 
gagctgtcca gcatggtcct ggaactcagg ggcctgcgca ccattgtgac cacgctgcag 960 
gacagcatcc gcaaagtgac tgaagagaac aaagagttgg ccaatgagct gaggcggcct 1020 
cccctatgct atcacaacgg agttcagtac agaaataacg aggaatggac tgttgatagc 1080 
tgcactgagt gtcactgtca gaactcagtt accatctgca aaaaggtgtc ctgccccatc 1140 
atgccctgct ccaatgccac agttcctgat ggagaatgct gtcctcgctg ttggcccagc 1200 
gactctgcgg acgatggctg gtctccatgg tccgagtgga cctcctgttc tacgagctgt 1260 
ggcaatggaa ttcagcagcg cggccgctcc tgcgatagcc tcaacaaccg atgtgagggc 1320 
tcctcggtcc agacacggac ctgccacatt caggagtgtg acaagagatt taaacaggat 1380 
ggtggctgga gccactggtc cccgtggtca tcttgttctg tgacatgtgg tgatggtgtg 1440 
atcacaagga tccggctctg caactctccc agcccccaga tgaacgggaa accctgtgaa 1500 
ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gccccatcaa tggaggctgg 1560 
ggtccttggt caccatggga catctgttct gtcacctgtg gaggaggggt acagaaacgt 1620 
agtcgtctct gcaacaaccc cacaccccag tttggaggca aggactgcgt tggtgatgta 1680 
acagaaaacc agatctgcaa caagcaggac tgtccaattg atggatgcct gtccaatccc 1740 
tgctttgccg gcgtgaagtg tactagctac cctgatggca gctggaaatg tggtgcttgt 1800 
ccccctggtt acagtggaaa tggcatccag tgcacagatg ttgatgagtg caaagaagtg I8 60 
cctgatgcct gcttcaacca caatggagag caccggtgtg agaacacgga ccccggctac 1920 
aactgcctgc cctgcccccc acgcttcacc ggctcacagc ccttcggcca gggtgtcgaa 1980 
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catgccacgg ccaacaaaca ggtgtgcaag ccccgtaacc cctgcacgga tgggacccac 2040 

gactgcaaca agaacgccaa gtgcaactac ctgggccact atagcgaccc catgtaccgc 2100 

tgcgagtgca agcctggcta cgctggcaat ggcatcatct gcggggagga cacagacctg 2160 

gatggctggc ccaatgagaa cctggtgtgc gtggccaatg cgacttacca ctgcaaaaag 222 0. 

gataattgcc ccaaccttcc caactcaggg caggaagact atgacaagga tggaattggt 2280 

gatgcctgtg atgatgacga tgacaatgat aaaattccag atgacaggga caactgtcca 2340 

ttccattaca acccagctca gtatgactat gacagagatg atgtgggaga ccgctgtgac 2400 

aactgtccct acaaccacaa cccagatcag gcagacacag acaacaatgg ggaaggagac 24 60 

gcctgtgctg cagacattga tggagacggt atcctcaatg aacgggacaa ctgccagtac 2520 

gtctacaatg tggaccagag agacactgat atggatgggg ttggagatca gtgtgacaat 2580 

tgccccttgg aacacaatcc .ggatcagctg gactctgact cagaccgcat tggagatacc -2640 

tgtgacaaca atcaggatat tgatgaagat ggccaccaga acaatctgga caactgtccc 2700 

tatgtgccca atgccaacca ggctgaccat gacaaagatg gcaagggaga tgcctgtgac 27 60 

cacgatgatg acaacgatgg cattcctgat gacaaggaca actgcagact cgtgcccaat 2820 

cccgaccaga aggactctga cggcgatggt cgaggtgatg cctgcaaaga tgattttgac 2880 

catgacagtg tgccagacat cgatgacatc tgtcctgaga atgttgacat cagtgagacc 2940 

gatttccgcc gattccagat gattcctctg gaccccaaag ggacatccca aaatgaccct 3000 

aactgggttg tacgccatca gggtaaagaa ctcgtccaga ctgtcaactg tgatcctgga 3060 

ctcgctgtag gttatgatga gtttaatgct gtggacttca gtggcacctt cttcatcaac 3120 

accgaaaggg acgatgacta tgctggattt gtctttggct accagtccag cagccgcttt 3180 

tatgttgtga tgtggaagca agtcacccag tcctactggg acaccaaccc cacgagggct 324 0 

cagggatact cgggcctttc tgtgaaagtt gtaaactcca ccacagggcc tggcgagcac 3300 

ctgcggaacg ccctgtggca cacaggaaac acccctggcc aggtgcgcac cctgtggcat 3360 

gaccctcgtc acataggctg gaaagatttc accgcctaca gatggcgtct cagccacagg 3420 

ccaaagacgg gtttcattag agtggtgatg tatgaaggga agaaaatcat ggctgactca 3480 

ggacccatct atgataaaac ctatgctggt ggtagactag ggttgtttgt cttctctcaa 3540 
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gaaatggtgt tcttctctga cctgaaatac gaatgtagag atccctaatc atcaaattgt 3600 

tgattgaaag actgatcata aaccaatgct ggtattgcac cttctggaac tatgggcttg 3660 

agaaaacccc caggatcact tctccttggc ttccttcttt tctgtgcttg catcagtgtg 3720 

gactcctaga acgtgcgacc tgcctcaaga aaatgcagtt ttcaaaaaca gactcagcat 3780 

tcagcctcca atgaataaga catcttccaa gcatataaac aattgctttg gtttcctttt 3840 

gaaaaagcat ctacttgctt cagttgggaa ggtgcccatt ccactctgcc tttgtcacag 3900 

agcagggtgc tattgtgagg ccatctctga gcagtggact caaaagcatt ttcaggcatg 3960 

tcagagaagg gaggactcac tagaattagc aaacaaaacc accctgacat cctccttcag 4 020 

gaacacgggg agcagaggcc aaagcactaa ggggagggcg catacccgag acgattgtat 4080 

gaagaaaata tggaggaact gttacatgtt cggtactaag tcattttcag gggattgaaa 414 0 

gactattgct ggatttcatg atgctgactg gcgttagctg attaacccat gtaaataggc 4200 

acttaaatag aagcaggaaa gggagacaaa gactggcttc tggacttcct ccctgatccc 4260 

cacccttact catcacctgc agtggccaga attagggaat cagaatcgaa accagtgtaa 4320 

ggcagtgctg gctgccattg cctggtcaca ttgaaattgg tggcttcatt ctagatgtag 4380 

cttgtgcaga tgtagcagga aaataggaaa acctaccatc tcagtgagca ccag 4 4 34 

<210> 25 
<211> 2837 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 25 

agagagccag tccgatgtct gcagcctccc tggccaggcc tctcctctcc tgccgcagct 60 

agtccccctc aggacagaca gagtactggc gtcggtcacc attcacttgc aaacacacca 120 

ggtcacgtga agaaacttcc tggtgacact caggctgtag ctgtgcactc ttcaaccacg 180 

aggttggttt tctcctaagt gtcacaggtg gagacaagat gctctgggca ctggccctgc 24 0 

tggctctggg catagggcca agagcttctg ctggtgacca cgtcaaggac acttcatttg 300 

accttttcag catcagcaac attaaccgga agaccatcgg tgccaagcag ttccgagggc 360 
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ctgaccccgg ggtgcccgcc taccgttttg tacggtttga ctacatcccc ccagtgaaca 4 20 
cagatgatct caacaggatt gtcaagcttg caaggagaaa ggagggcttc ttcctcacag 480 
cccaactgaa gcaggaccgc aagtctcggg gaacgctcct ggtgttggaa ggccccggca 54 0 
cctcccagag gcagtttgag attgtgtcca atggcccagg ggacactttg gacctcaact 600 
actgggtaga aggcaatcag cataccaact tcctggagga tgtgggcctg gctgactccc 660 
agtggaagaa tgtgactgtg caggtggcca gtgacaccta tagcctgtat gtgggctgcg 720 
atcttatcga cagtgtcacc ctggaagaac cattctatga gcagctagaa gtagacagga 780 
gcaggatgta cgtggccaaa ggtgcatctc gagagagtca cttcaggggc ttgctgcaga 84 0 
atgtccatct cgtgtttgca gattctgtgg aagatatctt aagcaagaaa agctgtcaac 900 
acagccaggg agctgaagtc aacaccatca gtgaacatac agagactctc catctgagcc 960 
ctcacatcac cacagatctc gtggtccagg gtgtggagaa ggcacaggag gtgtgtacgc 1020 
actcctgcga ggagttgagc aacatgatga atgagctctc tggactgcac gtcatggtga 1080 
accagctgag caagaacctg gagagagtgt ctagtgataa ccagttcctt ttggagctca 114 0 
ttgggggccc tctgaagaca agaaacatgt cagcctgtgt gcaggagggc cgaatctttg 1200 
cagaaaatga aacctgggtt gtggatagtt gtaccacatg cacctgcaag aaatttaaaa 1260 
cagtctgcca tcagatcacc tgctcacctg caacttgtgc caacccatct tttgtggaag 1320 
gcgagtgctg tccatcctgt tcacactctg cagacagtga tgagggctgg tctccgtggg 1380 
cagagtggac cgagtgttct gtcacctgtg gctctgggac ccagcagaga ggccggtctt 14 4 0 
gtgatgtcac cagcaacacc tgcctgggcc cctccattca gacaaggaca tgcagcctgg 1500 
gcaaatgtga tacgagaatc cgtcagaatg gaggctggag tcactggtca ccctggtctt 15 60 
catgctccgt gacttgtgga gttggcaatg ' tcacccgcat acgtctctgc aactcaccag 1620 
tgccccagat gggtggcaag aactgcaagg gcagcggccg ggaaaccaaa ccctgtcagc 1680 
gtgatccgtg cccaattgat ggccgctgga gcccctggtc cccttggtca gcctgcacag 17 4 0 
ttacctgtgc tggagggatc cgtgagcgct cacgtgtttg caacagccct gagccccagt 1800 
atggagggaa ggactgtgtc ggggatgtga cagaacacca aatgtgcaac aagagaagct 18 60 
gccctattga tgggtgctta tccaacccgt gttttcctgg agccaagtgc aacagcttcc 1920 



WO 99/37660 PCT/US99/013I3 

-78- 



ctgatgggtc 


ctggtcctgt 


ggctcctgcc 


cagtgggctt 


tctgggcaat 


ggtacccact 


1980 


gtgaggacct 


ggatgagtgt 


gctgtggtca 


cagatatttg 


cttctcaact 


aacaaagctc 


2040 


cccgctgtgt 


caacaccaac 


ccgggcttcc 


actgcctgcc 


ttgtccacca cgctacaagg 


2100 


ggaaccaacc 


cttcggtgtt 


ggcctggagg 


atgctaggac 


agaaaaacaa 


gtgtgtgagc 


2160 


cagagaatcc 


atgtaaggac 


aagactcaca 


gctgccacaa 


gaat gcagag 


tgcatctacc 


2220 


tgggccactt 


tagtgacccc 


atgtacaagt 


gtgagtgcca 


gattggctac 


gcaggtgatg 


2280 


ggctcatctg 


cggggaggac 


tcagacctgg 


atggctggcc 


caacaacaac 


ctggtgtgtg 


2340 


ctactaatgc 


cacctaccac 


tgcatcaagg 


acaactgccc 


caaactgcca 


aattccgggc 


2400 


aggaggattt 


tgataaggat 


ggaatcggag 


atgcttgtga 


cgaggacgat 


gacaatgacg 


2460 


gtgtgagcga 


tgagaaggac 


aattgccagc 


ttctcttcaa 


tccccgtcaa 


ttagactatg 


2520 


acaaggatga 


ggttggagac 


cgctgtgaca 


actgccccta 


tgtgcacaac 


ccagcacaga 


2580 


ucgacacaga 




y a yyyyy a, -y 


cctgct ctgt 


ggacattgac 


ggagacgatg 


264 0 


ttttcaatga 


gcgagacaat 


tgtccatatg 


tctacaacac 


tgaccagaga 


gatacggatg 


2700 


gtgatggcgt 


gggtgaccac 


tgtgacaatt 


gtcctctgat 


gcacaaccca 


gatcagatcg 


2760 


atcaggacaa 


tgatctcgtt 


ggagaccagt 


gtgacaacaa 


tgaggacata 


gatgatgacg 


2820 


gccaccagaa 


caaccaa 










2837 



<210> 26 
<211> 4108 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 26 

agagagccag tccgatgtct gcagcctccc tggccaggcc tctcctctcc tgccgcagct 60 

agtccccctc aggacagaca gagtactggc gtcggtcacc attcacttgc aaacacacca 120 

ggtcacgtga agaaacttcc tggtgacact caggctgtag ctgtgcactc ttcaaccacg 180 

aggttggttt tctcctaagt gtcacaggtg gagacaagat gctctgggca ctggccctgc 240 

tggctctggg catagggcca agagcttctg ctggtgacca cgtcaaggac acttcatttg 300 
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accttttcag catcagcaac attaaccgga agaccatcgg tgccaagcag ttccgagggc 360 
ctgaccccgg ggtgcccgcc taccgttttg tacggtttga ctacatcccc ccagtgaaca 420 
cagatgatct caacaggatt gtcaagcttg caaggagaaa ggagggcttc ttcctcacag 480 
cccaactgaa gcaggaccgc aagtctcggg gaacgctcct ggtgttggaa ggccccggca 54 0 
cctcccagag gcagtttgag attgtgtcca atggcccagg ggacactttg gacctcaact 600 
actgggtaga aggcaatcag cataccaact tcctggagga tgtgggcctg gctgactccc 660 
agtggaagaa tgtgactgtg caggtggcca gtgacaccta tagcctgtat gtgggctgcg 720 
atcttatcga cagtgtcacc ctggaagaac cattctatga gcagctagaa gtagacagga 780 
gcaggatgta cgtggccaaa ggtgcatctc gagagagtca cttcaggggc ttgctgcaga 840 
atgtccatct cgtgtttgca gattctgtgg aagatatctt aagcaagaaa agctgtcaac 900 
acagccaggg agctgaagtc aacaccatca gtgaacatac agagactctc catctgagcc 960 . . 
ctcacatcac cacagatctc gtggtccagg gtgtggagaa ggcacaggag gtgtgtacgc 1020 
actcctgcga ggagttgagc aacatgatga atgagctctc tggactgcac gtcatggtga 1080 
accagctgag caagaacctg gagagagtgt ctagtgataa ccagttcctt ttggagctca 1140 
ttgggggccc tctgaagaca agaaacatgt cagcctgtgt gcaggagggc cgaatctttg 1200 
cagaaaatga aacctgggtt gtggatagtt gtaccacatg cacctgcaag aaatttaaaa 1260 
cagtctgcca tcagatcacc tgctcacctg caacttgtgc caacccatct tttgtggaag 1320 
gcgagtgctg tccatcctgt tcacactctg cagacagtga tgagggctgg tctccgtggg 1380 
cagagtggac cgagtgttct gtcacctgtg gctctgggac ccagcagaga ggccggtctt 14 40 
gtgatgtcac cagcaacacc tgcctgggcc cctccattca gacaaggaca tgcagcctgg 1500 
gcaaatgtga tacgagaatc cgtcagaatg gaggctggag tcactggtca ccctggtctt 1560 
catgctccgt gacttgtgga gttggcaatg tcacccgcat acgtctctgc aactcaccag 1620 
tgccccagat gggtggcaag aactgcaagg gcagcggccg ggaaaccaaa* ccctgtcagc 1680 
gtgatccgtg cccaattgat ggccgctgga gcccctggtc cccttggtca gcctgcacag 1740 
ttacctgtgc tggagggatc cgtgagcgct cacgtgtttg caacagccct gagccccagt 1800 
atggagggaa ggactgtgtc ggggatgtga cagaacacca aatgtgcaac aagagaagct 18 60 
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gccctattga tgggtgctta tccaacccgt gttttcctgg agccaagtgc aacagcttcc 1920 

ctgatgggtc ctggtcctgt ggctcctgcc cagtgggctt tctgggcaat ggtacccact 1980 

gtgaggacct ggatgagtgt gctgtggtca cagatatttg cttctcaact aacaaagctc 204 0 

cccgctgtgt caacaccaac ccgggcttcc actgcctgcc ttgtccacca cgctacaagg 2100 

ggaaccaacc cttcggtgtt ggcctggagg atgctaggac agaaaaacaa gtgtgtgagc 2160 

cagagaatcc atgtaaggac aagactcaca gctgccacaa gaatgcagag tgcatctacc 2220 

tgggccactt tagtgacccc atgtacaagt gtgagtgcca gattggctac gcaggtgatg 2280 

ggctcatctg cggggaggac tcagacctgg atggctggcc caacaacaac ctggtgtgtg 2340 

ctactaatgc cacctaccac tgcatcaagg acaactgccc caaactgcca aattccgggc 24 00 

aggaggattt tgataaggat ggaatcggag atgcttgtga cgaggacgat gacaatgacg 24 60 

gtgtgagcga tgagaaggac aattgccagc ttctcttcaa tccccgtcaa ttagactatg 2520 

acaaggatga ggttggagac cgctgtgaca actgccccta tgtgcacaac ccagcacaga 2580 

tcgacacaga caacaatggc gagggggatg cctgctctgt ggacattgac ggagacgatg 264 0 

ttttcaatga gcgagacaat tgtccatatg tctacaacac tgaccagaga gatacggatg 2700 

gtgatggcgt gggtgaccac tgtgacaatt gtcctctgat gcacaaccca gatcagatcg 27 60 

atcaggacaa tgatctcgtt ggagaccagt gtgacaacaa tgaggacata gatgatgacg 2820 

gccaccagaa caaccaagac aactgcccat acatctccaa ctccaaccag gctgaccatg 2880 

acaacgacgg caagggcgat gcctgcgact ctgatgatga caatgatggt gttccagatg 2 94 0 

acagggacaa ctgtcggctt gtgttcaacc cagaccagga agactcggac ggtgacggcc 3000 

gaggtgacat ttgtaaagat gactttgaca atgataatgt cccagatatt gatgatgtgt 3060 

gccctgagaa caatgccatc actgagacag acttcagaaa cttccagatg gtccctctgg 3120 

atcccaaggg gaccacacaa attgatccca actgggtaat tcgtcaccaa ggcaaagagc 3180 

tggtgcagac agcaaactca gaccctggca tcgctgtagg tttcgacgag tttgggtctg 324 0 

tggacttcag tggcactttc tatgtcaaca ctgaccggga tgatgactac gctggctttg 3300 

tctttggcta tcagtcaagc agccgcttct atgtggtgat gtggaagcag gtgacccaga 3360 

cctactggga agacaargccc agtcgggctt acggctactc tggtgtgtca ctcaaagtgg 34 20 
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taaactccac 


gactggtact 


ggcgagca cc 


tgaggaatgc 


cct gtggcac 


aegggaaaca 


34 8 0 


cagaaggcca 


ggtccggact 


ctatggcatg 


accccaaaaa 


^ ^ ^ /— » 

CoutgyCtgy 


aaagactaca 


354 0 


ctgcctacag 


gtggcacctg 


attcacaggc 


ctaagacagg 


^ ^ ^ S 4* /T — ^ i—T ^ 

CloCaL^aya 


gtctt agtgc 


3600 


atgaaggaaa 


gcaagtcatg 


gctgactcag 


gaccaattt a 


tgaccaaacc 


tacgctggtg 


3660 


gacggctggg 


cctgtttgtc 


ttctcccaag 


agatggt eta 


t tt ct eggae 


ctcaagt atg 


3720 


agtgcagaga 


tgcctagaga 


gcagggctcc 


agctccagca 


at gt get gca 


aacacccctt 


*3 1 O f\ 

37 80 


cttagacaca 


tcagtccatc 


ttggcacttg 


tggcttttct 


gtca tt tggc 


atttcctgtt 


384 0 


tcttgacctt 


aactgagtgg 


atctacacct 


ccttcatcag 


caccaagtcc 


aagtgtcttc 


3900 


aaaggagaaa 


catcaattgc 


actccaagag 


cttccagcct 


gctgctggaa 


aacatttgga 


3960 


tgagatatga 


ggctcaccgt 


ggagcgaaga 


ccgagcattc 


cgctgtgttg 


ccttttcttg 


4020 


tttgtttaaa 


aagaatgacg 


tttacatgta 


aatgtaatta 


ettgeagtat 


ttatgtgtat- 


4080 


atggagtcga 


agggagcttt 


agagcaca 








4108 



<210> 27 
<211> 820 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 27 

tcgaccagag gaggggaggc cagttcctct cccaagggtg ccacacaccc ctccctgttc 60 

atcaccagac aggcccttcc ttcttageca tatgetaace ttctcctccc tgggaaattt 120 

cctctgcagg agecaaagea gatgggagct ggagttgctg gagctcctgg tetgtatgea 180 

gagcaggcat ccaggaaagg agaagagagt gtgacaatcc agcacctcag aatggagggg 24 0 

cctcgtgttc agggeggaaa gtacagaege aggcttgetg agggectctg gacacaggct 300 

ggaccagatg ctgtggatgt cgacccctgc actgactatt ggataaagac ttctttcaac 360 

taagagaaga tgcaaatcag cacacttttt tctttgttct gccagcttcc aggectaaga 420 

ctaggttttg ctgtctacag ccaactattc tattagttac aaaactcaat cattttattc 480 

agcaactgga tgttgactgt taactagaag ctctgtccta cttacagcac tttggatcat 540 
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caaaaaaata aagtaaaata gaaaactgag aaaactcaat ccatgaccag ggagaactta 600 

caggatgtta gagacaaaac aagcagacac ctgaaacaat caacgcccaa taaaacaaag 660 

taggatgaaa attctcttag ttctttgata acaatttgtt cactcataga aacattatta 720 

attggtaggg taagcagaca ctctgaaaca atgagaaaaa tactaaaaat tgacttgagt 780 

tatttcaaat tgcctcattg acctgttata tcataactct 820 

<210> 28 
<211> 2397 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 28 



tttttttttt 


catcctactt 


tgttttattg 


ggcgttgatt 


gttacaggtc 


ccagcctgta 


60 


gacatctttt 


actccaattt 


cctgaataga 


tagctttatt 


cctt caaggt 


aatatagtgc 


120 


ggtggcttct 


ggctgagatg 


tttgctgttg 


ttttcttcat 


cttgtctttg 


atgacttgtc 


180 


agcctggggt 


aactgcacag 


gagaaggtga 


accagagagt 


aagacgggca 


gctacacccg 


240 


cagcagttac 


ctgccagctg 


agcaactggt 


cagagtggac 


agattgcttt 


ccgtgccagg 


300 


acaaaaagta 


ccgacaccgg 


agcctcttgc 


agccaaacaa 


gtttggggga 


accatctgca 


360 


gtggtgacat 


ctgggatcaa 


gccagctgct 


ccagttctac 


aacttgtgta 


aggcaagcac 


420 


agtgtggaca 


ggatttccag 


tgtaaggaga 


caggtcgctg 


cctgaaacgc 


caccttgtgt 


480 


gtaatggaga 


ccaggactgc 


cttgatggct 


ctgatgagga 


cgactgtgaa 


gatgtcaggg 


540 


ccattgacga 


agactgcagc 


cagtatgaac 


caattccagg 


atcacagaag 


gcagccttgg 


600 


ggtacaatat 


cctgacccag 


gaagatgctc 


agagtgtgta 


cgatgccagt 


tattatgggg 


660 


gccagtgtga 


gacggtatac 


aatggggaat 


ggagggagct 


tcgatatgac 


tccacctgtg 


720 


aacgtctcta 


ctatggagat 


gatgagaaat 


actttcggaa 


accctacaac 


tttctgaagt 


780 


accactttga 


agccctggca 


gatactggaa 


tctcctcaga 


gttttatgat 


aatgcaaatg 


840 


accttctttc 


caaagttaaa 


aaagacaagt 


ctgactcatt 


tggagtgacc 


atcggcatag 


900 


gcccagccgg 


cagcccttta 


ttggtgggtg 


taggtgtatc 


ccactcacaa 


gacacttcat 


960 
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tcttgaacga attaaacaag tataatgaga agaaattcat tttcacaaga atcttcacaa 1020 

aggtgcagac tgcacatttt aagatgagga aggatgacat tatgctggat gaaggaatgc 1080 

tgcagtcatt aatggagctt ccagatcagt acaattatgg catgtatgcc aagttcatca 1140 

atgactatgg cacccattac atcacatctg gatccatggg tggcatttat gaatatatcc 1200 

tggtgattga caaagcaaaa atggaatccc ttggtattac cagcagagat atcacgacat 1260 

gttttggagg ctccttgggc attcaatatg aagacaaaat aaatgttggt ggaggtttat 1320 

caggagacca ttgtaaaaaa tttggaggtg gcaaaactga aagggccagg aaggccatgg 1380 

ctgtggaaga cattatttct cgggtgcgag gtggcagttc tggctggagc ggtggcttgg 14 4 0 

cacagaacag • gagcaccatt acataccgtt cctgggggag gtcattaaag tataatcctg 1500 

ttgttatcga ttttgagatg cagcctatcc acgaggtgct gcggcacaca agcctggggc 1560 

ctctggaggc caagcgccag aacctgcgcc gcgccttgga ccagtatctg atggaattca, 1620 

atgcctgccg atgtgggcct tgcttcaaca atggggtgcc catcctcgag ggcaccagct 1680 

gcaggtgcca gtgccgcctg ggtagcttgg gtgctgcctg tgagcaaaca cagacagaag 17 4 0 

gagccaaagc agatgggagc tggagttgct ggagctcctg gtctgtatgc agagcaggca 1800 

tccaggaaag gagaagagag tgtgacaatc cagcacctca gaatggaggg gcctcgtgtc 18 60 

cagggcggaa agtacagacg caggcttgct gagggcctct ggacacaggc tggaccagat 1920 

gctgtggatg tcgacccctg cactgactat tggataaaga cttctttcaa ctaagagaag 1980 

atgcaaatca gcacactttt ttctttgttc tgccagcttc caggcctaag actaggtttt 2040 

gctgtctaca gccaactatt ctattagtta caaaactcaa tcattttatt cagcaactgg 2100 

atgttgactg ttaactagaa gctctgtcct acttacagca ctttggatca tcaaaaaaat 2160 

aaagtaaaat agaaaactga gaaaactcaa tccatgacca gggagaactt acaggatgtt 2220 

agagacaaaa caagcagaca cctgaaacaa tcaacgccca ataaaacaaa gtaggatgaa 2280 

aattctctta gttctttgat aacaatttgt tcactcatag aaacattatt aattggtagg 2340 

gtaagcagac actctgaaac aatgagaaaa atactaaaaa ttgacttgag ttatttc 2397 



<210> 29 
<211> 4100 
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<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 29 

ggatcccccc gctccgctac catcttcatc gacctcaccc aggacgacga ctgagctccc 60 
tcttcctcgc cgcggactgg ggcgaccctg ttgctgctgc ggccgccgcc gctcctgccc 120 
ccacttcggc tcccgctcct gctcctgctc ecggccccac tcctgttcct gttcctgttc 180 
ctgttcctgt tcccggtcct gctccggctc ccggccccgc acccacctcc gctcctgctg 24 0 
cgggtctcca ggcccagaca aaataaaaaa agatatattt tttcagtccg tctctcccgc 300 
ccggtgtctt ctatggctga gggagtctgg ctctcggggc tctcgggtcg gctgggcggc 360 
tcggctggtt ggctggctgg cgagatggac cgctccggcg cgcagcgtcc gcggctgctg 420 
tgatgggtgg gcggagcgcg gaccggggat tatatacacg atgtgcatcc ataattgatg 4 80 
ttgtttgaga aaaacaaagt cataaagtgg cactcagaca gcactttggc ctggcgcccg 540 
gccaccatct gagtgcccaa ccgggcccgg cggttacatc acccccacat ggaccatcac 600 
ggcccattag caccaattgg ccagagtgtc gggagccacc gctaattgca gtaacgcgcg 660 
gctgccagac tgcaatttac cgcgcgatac tgcagtttac tgcagccgcg gtaaactgca 720 
gtacgcggcg gccgcaggaa atctactgta gtatttggcg gcggcgcgcg gtactgcaac 7 80 
tgtagtaaac tgtagctgca gtagagttac tgcagcgcca tcgggccggt gtggccgcca 840 
gggtaactgc acccgcagta aatttactgc agccggactt tgtgcgctgt ggagaccgcg 900 
ccgaactggg acccccccga ctcccccccg actccccccc gactcccccc cgactccccc 960 
ccgactcccc cccgactccc ccccgactcc cccccgactc ccccgggacg cgtccgcgcc 102 0 
tcgatgcgcc ccatcgcgcc ccgttccgct* tcgccacgct ccagttgccc cgcccccggc 1080 
acgtggcacg tatttccccc ccgtaaatca agagggatta tgcggatgtc tagtttatgt 114 0 
ctcaatttcc tctttccgga gataaaagcc gggacccccg cgccgaaaaa ggatacacca 1200 x 
gccgcgatgt cgccgctcgt ggcggtgctg gtgttttttt cggcggcgct gggggttcct 1260 
ggccccggcg tcgcgggaaa cccccgtggg ctcgatgcca tcttcgaggc cccggtcacg 1320 
cccgcgcccc ccactcgcca tcctcggcgc gaggagctgg agtgggacga tgaggatcac 138 0 
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ccgctgctgg )acctcgagcc gcccgtggga tcacgctgcc atccctacat cgcgtactcg 14 4 0 
ctgccgccgg acatgaacgc cgtcacgagc gtggtcgtga agccctactg ctcgccgccg 1500 
gaggtcatcc tgtgggcgtc tggcaccgcc tacctggtca acccctttgt cgccatccag 1560 
gccctggccg tcggagagcc cttaaatgag gcggccctca aggagctcgg agaggtggcc 1620 
gtgcacaagg actccctgcc gccgctgcgc tataatggag ggccccccgc cgagtaagag 1680 
accctgcggc ctgccgcccg gggtgcgcct cgtcgtgcct gccgccgccg ccgcttctgc 17 4 0 
ctctaacgcc. gccaccgccg ctgcagcagc agcccccgcc ggggccgggg ccggggcctc 1800 
gaagccggcc cgaccccccg ccgccgcccg gcccgcgaag ggcacgcccg cggcgtcggc 18 60 
ggcaacaaca gccacggggg ccgacgcctc cgccccggcc cccgaccccg gggcgcccac 1920 
gtgggacgcc ttcgccgccg agttcgacgt ggccccctcg tggcgcgcgc tgctggagcc 1980 
cgagatcgcc aagccgtacg cgcgcctgct gctggccgag taccgcggcc gctgcctgac 204 0. 
cgaggaggtg ctgcccgcgc gcgaggacgt gttcgcctgg acgcgcctca cggcgcccga 2100 
ggacgtcaag gtggtcatca tcggccagga cccgtaccac gggccgggcc aggcccacgg 2160 
gctggccttc agcgtccggc gcggggtgcc gatccccccg agcctggcca acatcttcgc 2220 
ggcggtccgg gcgacgtacc cgacgctgcc cgcgcccgcc cacggctgcc tggaggcctg 2280 
ggcgcgccgc ggggtgctgc tgctgaacac gacgctgacc gtgcggcgcg gggtccccgg 234 0 
ctcccacgcc ccgctcggct gggcgcggct cgtgcgcgcc gtcgtccagc ggctctgcga 24 00 
gacccgcccc aagctggtgt tcatgctctg gggcgcccac gctcaaaagg cctgcgcgcc 24 60 
ggacccgcgc cgccacaagg tgctcacctt cagccatccg tcgccgctgg cccgcacgcc 2520 
cttcaggacc tgcccgcact ttggagaggc gaacgcgtac ctcgtccaga cgggccgggc 2580 
ccccgtcgac tggagcgtgg actgagtcgg gcgtgcgcgc acaccgccgg cggaggacga 2 64 0 
ggagggggga ggggggtggg atggacggag gagagcggat gatggagccc gcgctcgccg 2700 
gcgccccggc cagcgcgctg ccggtcctgg cggtgctgcg cgagtgggga tgggccgtgg 27 60 
aggaggtcga gccctccggg ccgtgcccgg aggacgcgga cgcgccccgg gagagcgcac 2820 
cccctccccg ggagggggtg cgcgggagcg aagacggaga ggggggcgtg gaagacggcg 28 8 0 
aggaggggaa ggcgacggag aaggaggaga cggaagacga ggaagacggg ggggacgaag 294 0 
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ggacgacgac ggcggcggcg ggcccgcgcc gggcgcagca cgtggagttt gacacgctgt 3000 
ttatggtcgc gtccgtggac gagctcgggc gccggcggct gacggacacg atccgccggg 3060 
acctggccgc ggccctggcc ggcctccccg tcgcctgcac caagacgtcc gcgtttgcgc 3120 
gcggcgcgcg cggcccgcgc ggcgcccccg ggcgcggcca taaaagcctg cagatgttta 3180 
tcctgtgccg cagagcccac gcggcgcgcg tacgcgatca gctccggtcc gcggtgcgcg 3240 
cccgacgccc acgcgagccc cgcgcgcgcc cgacgagcgg acgggcgcgg ccggccgcgc 3300 
cggtgttcat ccacgagttc atcacccccg agccggtgcg gctgcaccgg gacaacgtgt 3360 
ttgcggcgcc atgagcacct tcggacgcgc gtccgtggcc acggtcgatg actaccaccg 3420 
gttcctgcag gccaacgaga cggccgcccg gcgcctggcc gcggcctccc gccgcgtctc 3480 
caccggcggg ggcgagacgc gggccccgcg gtcctcgcgc ggcccccacg acgatgaggc 354 0 
gcccctgcgc gccggcggcc tgggcaccgc ccgcgggcgc tcgcgccagc gcggcgcgac 3600 
cgagccggac cccgtctacg ccaccgtcgt ccagcctacc caccaccacc accagcagca 3660 
ccaccaccgc tctcagcatc cgcagcagca gcaacaacag cagcgggccc cacgccgccg 3720 
cggcagcgtg cacgcctcgg cgacggccgc ggacggaccc gagtcgtgcg cggccgcacc 37 80 
cccgcgccgc cgcggcagcg tgcacgcctc ggcgacggcc gccccggcgg tccagctgcc 38 4 0 
ccggccccgg caacggagca tcaacgcctc gacgacggcc gccccgacgc cccagctgcc 3900 
gagaccccgc cagcgcagcg tcaacgcctc ggcccgcgcc gccgtcccct cgacggccac 3960 
cctcccgcgc ccccggaccc cgtcccgggg ccggcgcgcg ccccccgcct catgctgtta 4 020 
tcgcgatcaa taaagggcga gcgcccacgg accagacaaa agacacaacc ggttcggtct 4080 
ctctgtccgc gcacgcgcgg 4100 

<210> 30 
<211> 38734 
<212> DNA 

<213> Unknown , 
<220> 

<223> Description of Unknown Organism: Unknown 
<400> 30 

gatcctcgtg accgggtaca ccgacgcctc ctggacgccg ctgttcgcca tcgcgggcgg 60 
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ggtcgtcacc gacatcgggt cgatgctctc gcacagttcc atcgtggccc gcgagttcca 120 
cgtcccgtcg gtggtgaaca ccaaggacgc cacccagcgc atcaacaccg gcgacctgat 180 
cgtggtggac ggcgacgcgg gcacggtcga ggtcgtcgag agcgcggaca ccgacccgca 24 0 
gggcccggcc ggggccgccg ggaccccggc cggagccacc accgactgaa gccggccacc 300 
gccgcaacac cggaccacga ccgcccccgc gaggggcgga ccacacccca gacgggagac 3 60 
gacccgatga tccccaacca gtggtatccc atcgtcgagg cgcaggaggt gggcaacgac 420 
aaaccgctcg gtgtgcgccg catgggccag gacctcgtgc tctggcgcga catcgacggc 4 80 
aacctcgtct gccagggcgc ccgctgcccg cacaagggcg ccaacctcgg cgacggccgc 54 0 
atgaagggca acaccatcga atgcccgtac cacggcttcc gctacggagc cgacggtgcc 600 
tgccgggtga tcccggcgat gggctccgag gcccgcatcc ccggctcgct gcgggtaccc 660 

acctacccgg tccgggagca gttcggcctg gtgtggatgt ggtggggcga cgagcgcccg--720 

acggccgacc tgccgccggt ggcggccccg gccgaggtga cggacaaccg gaagctgtac 780 
gccaccaagc gctggacccg cccggtgcac tacacccgtt acatcgagag cctgctcgag 84 0 
ttctaccacg tgacctacgt gcaccgggac cactggttca actacatcg.a ctacctgctc 900 
ctgtacggca ccccgagcaa gttcggcctc gacggccgcg agcggtacct ggccgccacc 960 
cggatcacca accaccgggt ggagacggag gcggaggggc agaccatccg ctactccttc 1020 
gaccactgcc aggaggacga ccccaccaac accacccact acgtcatcac gttcaccttc 108 0 
ccgtgcatgg tgcacgtgca gaccgagcag ttcgagacca cctcctggct ggtgcccatc 114 0 
gacgaccaga acaccgagca catcctgcgc tggtacgagt acgaacaggt caagcccgtc 1200 
ctgaggttcg aaccgctgcg ccgtctgctg ccctgggcgt ccctctacat ggagaagtgg 12 60 
gtgcaggacc cccaggacgt ccgcatcatg gaacaccagg aacccaagat cagcgccggc 1320 
ggcgtgaaca agttcatccc cgtcgacgag atgaacgcca agtacatctc gatgcgcgcc 1380 
aagctgatcg cggacgcctc ggccgcgccc tcgtcaccgg cgcgggcggc ggagcccgag 14 4 0 
ccggaagcgg cggggcgggg cggatcagcg gcccgtgcca cgggcaacgg caggggagcg 1500 
gccggcggac gacgcggcac caagcccaag gaggacgccg ccgcgcgccc gtagacccga 1560 
agacggggga cggacaagag agagcgagag tgagagatgt acggcggata cgacgcgtcg 1620 
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accggcccca aggccctggt gacggccttc 
ttcctgttcg gcggcgcgga caccgtggcc 
gtgaccctgc gccgggtcct gctggcgacc 
gccacgaact tcgtgatgct ccagcgcaag 
atctgggtcc tggtgatcca cggcacgatg 
gtgagcgcgt tcacctggct gggcgtcgtg 
gggtcggagt accagcgcaa actctggaag 
accgaaggcc tgttcaagca ctcgatgcac 
tccgggttcg cgctggtcac gggcaccccg 
tgcatgttcg tcttcctgaa catccccatg 
gaggccttcg acgagtacgc gtcccggacg 
cgcccgtcac gcgcgtacgg cggcctcccc 
accacagatc ccacagatcc ccacagatcc 
agatccacag atcccctcca cagatccgag 
tcggagaggc cgaggcgtcc ctgaccgccg 
ccgaggcgtt cgcccggctg acctccgacg 
cggagctgac cgacgactac ctgctccccg 
gggtcgacct ggagggactg gagaacgtgc 
accactccgg caccctgccg ctcgacgccc 
acagcacgca ccgcaggctc cggctgctcg 
tccgtgacct cgcccgcaag gccggccacg 
tgctcggctc cggcgaactg gtcggcgtga 
ccttcgagga gcgctaccgg ctgcagcgct 
tgcggtcgcg gcgccccatg gtgccgtgct 
tgatcggctc ggcccccacc ctggcccgga 
cgaccttccc gctgctgggc gcgctgggcc 
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aacaccgtcg ccgtggccgg cgccgtgtgg 1680 
gactggttcg gcaccgactt cgacgaggcg 1740 
ctgtcggtgc tctacctgct gcgcttcatc 1800 
atggagtggt cggagtcggc caccatcggg 1860 
gcgtacttcg gcggcaccaa cgacgccggc 1920 
ctgtacctcc tcgggtccta cctgaacacg 1980 
aagcgcccgg agaacaaggg caagctctac 204 0 
atcaactact tcggtgacgc cgtgctcttc 2100 
tgggccttcg ccatccccct gatcatggtc 2160 
ctcgacaagt acctcgccga gcgatacggc 2220 
gcgaagttcg tcccctacgt gtactgaccc 2280 
gggcgagggg ggccgccgta ccgggtggca 2340 
ccacagagcc cctccacaga ccccctccag 24 00 
acgaggcacg tatgaccgga gacattccct 2460 
aggtgctgcg cgaggtcctg gccggcggcg 2520 
agggcgccgt cgacgacttc ggcttcgacc 2580 
ccctgcgcct gctgtacgag aagtacttcc 2640 
cggccgaggg gggcgcactc ctggtcgcca 2700 
tgatgctcca ggtggcgctg cacgaccatc 2760 
ccgccgacct tgccttcgac ctccccgtcg 2820 
tacgcgcctg ccccgagaac gcgctgcggt 2B80 
tgccggaggg ctacaagggg ctcggcaagc 294 0 
tcggccgggg aggcttcgcg gcggtggcac 3000 
cgatcgtcgg cgccgaggag atctacccga 3060 
tgctgaagct gccgtacttc ccgatcaccc 3120 
tgatcccgat gccgaccaag tggaccatcc 3180 
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gcttcggtgc cccgatccac acggacggct tccccgagga cgccgcggag gacccgctgg 3240 

tggtcgagaa gctcgccggc gaggtgaagg acaccatcca gcacacgctc aacgagatgc 3300 

tggagggccg cggctccccg ttcgtctgag ggccgcggct cccggttcgc ccgagggcgg 3360 

cggctcccgg ttcgcccgag gaccgtccct ctcgtccggg gccccgcctc agccccccgc 3420 

cgacgatccc cggcggcaga tgctgcgaac gctggcgaag gccagaacgg cgaggccgac 34 80 

gagcgtgacg ccgccgccga ccagctccgc ggacagatgc atgggatctc cctcaggggg 3540 

acgacggacg gtgatggtca tatagccatg cgaaccccgc cgtccgcccg atccgcagcc 3600 

gcaccgcccc gcgaattcac ccgtagagca gaccggtgcg gccgaggagg ggtggcgatt 3660 

gggtggtcgc gcgttcgaac gcttacgatc ctctgttgtg tccaaactga ccgacgtgcc 3720 

caagcggatc ctcatcgggc gcgcactgcg cagcgaccgg ctgggtgaaa cgctcctgcc 3780 

gaagcgcatc gcgcttcccg tgttcgcgtc cgacccgctg tcctccgtgg cgtacgcgcc 3840 

cggcgaggtg ctgctcgtcc tgtccatcgc gggcgtgtcg gcctaccact tcagcccgtg 3900 

gatcgcggtc gcggtcgtgg tcctgatgtt caccgtggtc gcctcctacc ggcagaacgt 3960 

gcacgcctac • ccgagcggcg gcggcgacta cgaggtggcc accaccaacc tcgggcccaa 4 020 

ggccggt'ctg accgtcgcca gcgccctgct ggtcgactac gtcctgaccg tcgcggtctc 4 080 

catctcctcc ggcatcgaga acctgggctc cgcgatcccc ttcgtcgtcg agcacaaggt 4140 

cctgtgcgcg gtcgccgtga tcctgctgct cacgctgatg aacctgcgcg gggtcaggga 4200 

gtcgggcacc ctgttcgcga ttccgacgta cgtcttcgtc gcgggcgtct tcatcatgat 4260 

cgtgtggggg gcgttccgcg gactggtcct ggacgacacc atgcgtgccc cgaccgcgga 4 320 

ctacgagatc aagccggagc acggcggcct ggccggcttc gcgctgatct tcctcctcct 4 380 

gcgcgccttc tcctccggct gtgccgcgct caccggtgtc gaggcgatct ccaacggcgt 44 40 

cccggccttc cgcaagccca agtccaagaa cgcggggaac accctcgcga tgatgggtct 4 500 

gctggccgtc accatgttct gcggcatcat cgcgctggcc gccgcgaccg acgtgcggat 4 560 

gtcggagaac ccggccaccg acctcttcca caacggcgtc gcggtcggcg cggactacgt 4 620 

ccagcacccg gtgatctcgc aggtcgccga ggcggtcttc ggcgagggca gcttcctgtt 4 680 

catcgtgctg gccgcagcca ccgcgctggt cctcttcctc gccgccaaca ccgcgtacaa 4740 
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cggcttcccg ctgctcggct cgatcctcgc ccaggaccgc tacctgccgc gecagctgca 4 800 
cacccgcggc gaccgcctgg ccttctccaa cggcatcgtg ctcctcgccg gagccgccat 4860 
gctcctggtc gtcgtctacg gcgccgactc gacccggctg atccagctct acatcgtcgg 4 920 
cgtcttcgtg tccttcacgc tcagccagat cggcatggtc cgccactgga accgcaacct 4 980 
ggccggcgag cgggaccagt ccaagcgacg ccacatgatg cgctcccgcg cgatcaacgc 504 0 
cttcggcgcc ttcttcaccg gcctcgtcct ggtggtggtc ctggcgacca agttcacgca 5100 
cggcgcctgg gtcgcgctgc tcggcatgtg catcttcttc gcgaccatga cggcgatccg 5160 
caagcactac gaccgggtcg ccgaggagat cgcggccccg gaggaccccg aggaggcaca 5220 
gagcgacgac atggtgcgcc cctcacgcgt tcactcggtg gtcctgatct ccaagatcca 5280 
ccgccccacg ctccgcgccc tcgcctacgc caagctgatg cgctccgaca gcctggaggc 534 0 
gctcagcgtc aacgtcgacc cggccgagac gaaggcgctg cgcgaggagt gggagcgccg 54 00 
cggcatcgcc gtaccgctga aggtcctgga ctcgccgtac cgcgagatca cccggccggt 54 60 
catcgagtac gtcaagagcc tgcgcaagga gtccccgcgc gacgcggtct cggtgatcat 5520 
ccccgagtac gtggtcggcc actggtacga gcacctgctg cacaaccaga gcgccctgcg 558 0 
cctcaagggc cggctgctgt tcacgccggg cgtcatggtc acgtcggtcc cgtaccagct 564 0 
ggagtcctcc gaggccgcca ggcgccgggc gcgcaagcgc caggactgga gcgcgccggg 5700 
tgcggtgcgg cgcggaccgg cccaccacca ccaggaccgt gaccgtacga aggactcctc 57 60 
ctcgtccacg tagactggac ggctgttgtc cctgtcatcc ccccgttctc tggagtcacc 5820 
ccgccatgca ggcagaaccg aagaagtcgc aggcggaaca gcgagcggtc gcggagccgg 58 8 0 
tctcggagcc ggtctcgctg gtgggcgagg agtacgaggt cgaggtcggc cccgtcgccc 594 0 
acggcggcca ctgcatcgcc cgcacgtccg agggccaggt gctgttcgtc cggcacacgc 6000 
tgcccggcga gcgggtcgtg gcccgggtga cggagggcga ggagggtgcc cgcttcctgc 6060 
gggcggacgc ggtcgagatc ctggacccct ccaaggaccg catcgaagcc ccctgcccct 6120 
tcgccggccc cggccgctgc ggcggctgcg actggcagca cgccaagccg ggcgcccagc 6180 
gacgcctgaa gggcgaggtg gtcgccgagc agttgcagcg cctggcgggt ctcaccccgg 624 0 
aggaggccgg ctgggacggc acggtgatgc cggccgaggg cgacaagctg ccggccggcc 6300 
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aggtcccgtc gtggcgcacg cgcgtgcagt tcgcggtgga cgccgacggt cgcgccggtc 6360 
. tgcgccgcca ccgctcccac gagatcgagc cgatcgacca ctgcatgatc gcggcggagg 6420 
gcgtcagcga actgggcatc gagcgccgtg actggcccgg catggcgacg gtcgaggcga 64 80 
tcgcggcgac gggctcccag gaccgccagg tcatcctgac cccgcgcccc ggcgcccgcc 654 0 
tccccatcgt cgaactggac cgcccggtct cggtcatgcg cgtcggggag aaggacggcg 6600 
gcgtccaccg cgtccacggc cgccccttcg tccgcgagcg cgccgacgac cgcacctacc 6660 
gcgtcggctc cggcggcttc tggcaggtcc acccgaaggc cgccgacacc ctggtcaccg 6720 
cggtcatgca gggcctgctg ccccgcaagg gcgacatggc cctggacctc tactgcggcg 6780 
tcggcctctt cgccggcgcc ctggccgacc gcgtcgggga ccagggagcg gtcctcggca 68 4 0 
tcgagtccgg caagcgcgcc gtcgaggacg cccgccacaa cctcgccgcc ttcgaccgcg 6900 
tccgcatcga gcagggcaag gtcgagtccg tcctgccccg caccggcatc -gacgaggtcg 6960 
acctcatcgt cctcgacccg ccccgcgccg gcgccggccg caagacggtc cagcacctct 7020 
cgaccctggg cgcccgcagg atcgcctacg tggcctgcga cccggccgcg ctggcccggg 7080 
acctggggta cttccaggac ggggggtacc gggtgcggac gctgcgggtg ttcgatctgt 714 0 
tcccgatgac tgcgcacgtt gagtgcgtgg cgattttgga gcccgccgca aaggggctct 7200 
gacctgcatt tttcttggct ggatcaggag cggcctgttg cgctcgacct gttctccaaa 7260 
gcgcacgacg tagagcttgc ggaccgctcg tgaaagccgc ctgacctggc gttgcacgag 7320 
cggtgccgcg atgtcggcgt ggtcggccct tctcctggcg cgaagggaaa ccgaaggtct 7380 
tgacgctcgg gtgacgctat ttctgaaggg tcgtcaccga ctggggaggc agggccctgc 744 0 
ctctcgcgcc cgatgaagca ggttctctct gctccaggta atcgtcgagg gtgccctgac 7500 
ggatcaggta gacggtcagg gaccgcaggg tgcagggcgt cgacgcccag gtcgaggatc 75 60 
atcagggcgc tgtcattggt gatggcgaag gcgccgatca caccgtcgac ggaacaggtt 7620 
gcgttcagga gtctgtgcgg gcgccgcacc gg'cacggtct ggaagctcgg ctccgaccgc 7680 
agctcgccac cagtccgaga ggagccgata ctgtccggtg ccgggtgacc cttcgtgcaa 7740 
gcgttgctgc ccccgctcgg cagaccgggg cagcaacgct tgcacgatcg gccggtactc 7800 
aacgggatcg tgtggagttt cggaccggaa cggcttggca ggacgtgccc gagcggtacg 7860 
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gctcctgggc cacattgcac acccgcttcc gtcgatgggt gaaaggcggc acctttcagt 7920 

gaaagggggt tccgcccccc cccgggacct tgcgcccacc gtcgccgacc ggctgatgaa 7980 

ccggctccgc gctcccgcca ccaacctgac ccgacgtgag accgaagtcc tctcaccggt 804 0 

cgccgacgga ctgtccgacc aggccatcgg cgcacgcctc cacttgaccg aaggcaccgt 8100 

cgatatcacc tggcctgcat ctatgccaac ctcggaaccg actcgcgcac cgccgctgtg 8160 

gccactgtca ccgccatcga cgacctcggg ctcatccgcc gctgaacagt atgtggtggg 8220 

cggtgtttcc gttctccacg acttcagcgg cgtccggagt tgtggtgctg gctgggcttg 8280 

gtgcccgctc ctctgaaccc atgtgaacgc ccacggccag ttcgagccgg acacgccccc 834 0 

cggacctggc ctccgccgcg gccagaatgc ccggcccctg cacctggtct gcctacctgt 8400 

acaggcgagg gcggtccctc ggagccactg cctgtaactc cgaggggccg cccttggccg 84 60 

actcggcggt caccgcggac gcggtgcggt caggaggcac cgtctgcgtc atcaggcgcg 8520 

gctcggccga gcgagttatt ccggcacccg tgggaccaga aatgtcagcc ctgcgtgacc 8580 

gcttcgaaga ccgtgacgcg gttgtcgtcg gagagcgagt gggtggggtc ggcgtgcgcg 8 64 0 

gcgcggaatg cctccgacga ggtgtaggcg gtgaatgcgg cgtcgtcctc gaagttgagg 8700 

acggccaggt agccgtgtgc gcccttgcgg ggacgcagca gccgtgcgtt gcgcagcccc 87 60 

ggcacgttgg aaagggtzggc gcgcatgctg gcggtgcagt tgttctcgaa cgcgccctgg 8820 

gcgggcgcgg cgacggtgaa ctcggtgacg gcggtgctca tttctgtctt ctctcggttg 8880 

ttggtgtgat gtcggtggct gtcccgcccg gccggggccc gcacggcgat ggcgatgatg 8940 

tgcaccgcgt gtccgatcga gttccgttgc ggggtgcggt tacagggctg gagttgggct 9000 

cgggtccgcg gtcggctgag ggagcctgcc tgtgcggcgg gcccagattt cgaacgcgat 9060 

agtcatgaac gggggtacgg cggccagcag ggcgaagatc gttgtccggc ccagccgcca 9120 

cttcagccgg atcgcgacca ggacggtcag ggacacgtag acgatgaagg cggcgccgtg 9180 

gagggtgccg aagatccgta cgccgagttc ggtggtttcg gggatgtact tgaggtacat 9240 

cccggccagc agacctgccc acgtgcacgc ttcgatgatc gcgatccagg tgaacgcgcg 9300 

cagcagacgg ctggtgccgg tttcggcagc ccgtgcggcc ggcgcgtcgg cggaaggcgg 9360 

ttcggaggtg ggcgtggagg gtgtctgcgg ggtgcctggg cgtgcgggcc acagggcgcg 94 20 
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gttgccgagc aggcggacga tggcggggac caggagcggg cggatgagga aggtgtccag 94 80 
caggatgccg caggccatgg cgaagccgaa ctggaacagt tcgcggatcg gctgggtcat 954 0 
caggacggcg aaggtcgccg cgaggatgag gcccgcggag gagatgacgc cgccggtgcg 9600 
tgtcagtgcg gcggtgatcg ccttcgctgg gggctgggtg cgcagttcct gcttgaaccg 9660 
gctcatgatg aagatgttgt agtcgacgcc gagcgcgacg aggaagacga agatgtacgc 9720 
ggtgacgcgg ttgccgatgc cgtcgtcacc gaggacggtc acggtgaaga aggtggtggc 9780 
gcccagggtg gccaggaacg acaggagcag ggtcgcgacc aggtagagcg gggcaaggag 984 0 
cgagcggagc agcaggacga ggaccacggt gacgatggct aggaccagca gcacgatgag 9900 
ggtcgtgtcg cggtcgaggg cggagcggat gtcggcgttc tgcgcggtct cgccgccgat 9960 
gagcaccgtg gcgtcctgga cgccggcggc ctgggctgcg gattgtgtgg cctgcttgag 10020 
gggaccgatc gcgtcgagtg cettggagct gtaggggtcg,. aggtcgagga tgacgtcgta 10080 
gaagacggtc ttgccgtcct tgcccatgcg ggggtctgcg acacggctga cgtgatcggc 10140 
gtcggtgagc gcggtggcga tgtcggcggg tgcggggctg gagcgcaggt tgtcctggga 10200 
atggacgacg acggtactgg gggcgatctc gccgggcccg aattcctccc gaatgaggtg 10260 
ctgtccgtgc tccgactcgg tggcggcgcg gaagccgctg agggtgttga agctctcctg 10320 
gtagccgagc agtcccgcgc tcagtaccac caggagtgcg atcacggccg aggccacctt 10380 
gacgggggcc cgtgcgacca gggcggcgat gcggtgccag atgcctgcgc cgcgactgcg 104 4 0 
ttcggcggcc ttgtccacgc ccccgggcca gaagacgctc ctgcccagca ggaggaccag 10500 
ggcggggatg aaggtgaacg ccaccagcgc catgacggcc acgcccagag cgaggtacgg 10560 
tccgaagccg tgaagtgccg gggagacggc cacgagcagg gcaaacatgg cgagcacgat 10620 
ggtcgaggcg ctggcgagga cggactcggc ggtgcggcgc acggcggcct gcatcgcgcg 1068 0 
ggcgcggtct ggctcgtcga gcagggtctc gcggtagcgg gcggtgatga tcagcgcgta 10740 
gtccgtgccc accccgaaca gcagcacggt catgatcgag gcggtctggg' agctgaccgt 10800 
gatgactccg gcgtccgcga gaatcgcgcc gagagtctcc gccacgcgca tagccacgcc 10860 
cacggcaaga agcggcacga gcgccatcag gggcgagcgg tagatcgcca gcaggatgat 10920 
caggacgagc acgacggtgg ccagcagcag gactttgtca ccgccgctga agaccttcac 10980 
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ggtgtcggtg gcgatcccgg cggggccggt caccgcgacg tcggcgggcc cggcccggtc 1104 0 
ggacgcgagg gcacgcacct cgtcgaccgc attctggaag gactcgtccg aggggctgcc 11100 
ctccatgggc acgatgacca gctgagcacc gcggtcctgc gagaccaact cggccgcagc 11160 
gtcgggagcg gtcaccgtgg agaccacgct cacgacatgg tcgggtcggc tggttccgga 11220 
aagggccgag gtgatggcgg cgaccgattg cgtggcgctc ttcgcggcgt cggtgccctt 11280 
gccgcggacc acgatgatcg ccggcgtcgc gtcctggccc ggaagctggg cgcggacgag 1134 0 
atcacgggcc ttcatggagt ccgaggcggc gggcggcagg ttggcggagg cgttgtcctc 11400 
gacggattcc agggccgggg cgaccccggc gaggaggccc gcgatcagga cccagaaggc 114 60 
caccaccacg gcggcgcgct tcttcgatcc caggagacat cgcagcagag cgggggagtt 11520 
catcggttgc atcgggcagc cttcggcagg aagtacggac agaacttagc gacagggtgt 11580 
ctctaagttg cgtcaagcta acacgccccc tcggcctctc gggcgtgggg gtaggttggc 1164 0 
gggagacggc acagcgtccg aggtgaagcg gagaaaatgc ccaagattga agccggcagc 11700 
gtccgggagc accgggcgca gcggctcgcg cagctgattg acgcggccga ggagctcctg 11760 
gaagagggcg gtgccgaagc cctcacagcc ggagcggttg ccgcgcgagc cgggatcgcc 11820 
cgcaacagca tctaccgcta cttcaactcc atcgacgacc tgctcgaact cgtcgtcacc 118 80 
cgcgaattcc ccgcctggat cgacgcagtg gagcaggcca tcgcggccga gaccacaccc 1194 0 
gccgcccagg ctgccgccta cgtcagggcc aacctcgaac aggcagctcg cggcacccac 12000 
ggctggcggg ccgcgctcac gcgcgactcg ctctccccgt cggcgcggga gcgggtgagg 12060 
aatctgcaca tctcgctaca cgaggcgctc gcccgggtcg tgcgcgaact ggggcagcca 12120 
cagcccgagc tgaccgtggc ggtggtccaa gcagtcgtcg atgcgtgcat ccgcagaatc 12180 
gaccaaggcg acgatctgac aaccgtgtcc gacttcgcgg ccggagcgac gcgtcgactg 12240 
ctcgcggatg acgacttgcc acatcacccg tgacgcaccc cgtccaggcg gctcgcaggc 12300 
ccgtcgacag cgaagccccg gcagaacgag ccggatcttg agccgcaccg gagcgtgacg 12360 
cagaccgctg gtggctcatg cctcgtctca tccgatcttg ccaccgggcg gccgaccggt 12420 
cagtgcccga cgcccatcga ttacgacgtc cacgacccga accagcgcgt tcagtgcgtt 124 80 
gacgttcgtg gtgcgctcat tggtcacccg gcctctgggg gtcaccagcg cttttagggc 1254 0 
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acgagactcg 
tccggcttcc 
gcaagcagga 
tcgatgacgt 
gagcggcgtg 
tgccgagcac 
gaacgtcttc 
tccgtcgccc 
cgcgtcttcc 
cccgacggac 
ctgggaccgg 
ttgtgggtct 
ctgcgtgtcg 
ggtcttggcg 
tcgcggagcg 
ccggcacccc 
atccaggtca 
cccacgtcct 
tcgcgggcga 
aagcgctgac 
gtgccgtccg 
cagccgacgg 
tcatatcctc 
gcagcctcct 
atttcctcac 
ttgtcgaaga 



acggtggcgc 
cgacgacctg 
acttgccatc 
tctggccggt 
cgctacctcc 
agcgtgcgtg 
ggagccgagc 
gcaatcacgc 
tgcggttcaa 
ccgaggcccg 
cgctgctgat 
cggtggtcag 
gcgccgggcg 
tggtcgctgt 
catacggtgg 
ggcgaacagc 
gtcgccacag 
ccccggacac 
cgatccgctg 
tgctcgtgat 
tgccgacgag 
cgtaggcgtc 
tgagggtcac 
cgtgggctga 
cggcctcgac 
aggaggtggt 



gtgataccag 
gacacgaagc 
ggggccattc 
ctgatggaca 
agatcgacga 
acatgggcct 
tgtatcccac 
gctgatcgac 
cggcgccagc 
tcacgcgctg 
cgccctgtcc 
tgaggtggcg 
tctgggagcc 
caaactcatc 
cagaccggtc 
cgcacgtccc 
gcgcagaagc 
cacgaagtcc 
aagggcttcg 
cagccggtcg 
aggaccgccg 
gagctctccc 
cttgaacagc 
tccgcgtccc 
gaggaggcgg 
gtggcccgtg 
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gcaggcatca 
ttacggagcg 
gtgatgcccg 
gcgatgcgga 
gatcctggcc 
ccttgtgtcg 
cctgcacgag 
ggcaacaagc 
gccagtaccg 
ctcaccagca 
gccctggggg 
gcgccggccc 
ctgatcgccg 
gccgggcact 
ggaatcctcg 
cgtccggtcg 
ccggtgccgt 
ttggcctgcc 
aggtcggctt 
gcggccttgc 
aaggtgtcgg 
tcggcccagc 
ccgtcggcct 
gccgcgtaac 
cgtgcctttt 
atgctcgtcg 



tgaccttatg 
ggctcgtggg 
ggaccgggcc 
gattctggac 
atcgtgcgca 
gcgatcgaac 
aagccgcggc 
gcaccgcctg 
tcccgcccca 
gccctctcct 
ttctcgccct 
ggtggggctt 
gcgtactcaa 
gagagggcct 
gcgccgcggc 
ggtcaggtcc 
ccaccgcgta 
acagcgggac 
cggcgtcgct 
tgccgtaccc 
gatcggggta 
tcttctggaa 
ctagttgctt 
cgtaggtgaa 
cggcgtcctt 
ggatgaggga 



gcgatgacac 
gagggttgca 
gagctgaagg 
tacctgaagt 
cggtcaacgg 
ggccccggac 
actactgcac 
gttcgccatg 
cgggcgccgg 
cagcagcgca 
ggacacggcg 
cgtgggcggg 
cgccgtgatc 
gtggtggtgt 
cggagcggtc 
gagccgtcag 
ctggccgccg 
gacgggcacg 
ccggtcggcg 
cgtcgccatg 
gtcggcgacc 
ttcgtcccat 
tttcacctcc 
agacagcggg 
gtgagggtag 
gtagagcggg 



12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040' 
14100 
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tccacggttc cgtcgtagac gtcgtaggaa atccggtccc tgtctatcag ccaggccgcc 14160 

gcctgccgtg cgcgtctgtc gtgaaacggc ttgccgcggc ggttgttgag gtacaggttt 14220 

cgagtctccg cgctctgcgc ctccgtcacg cgaagccccg gatcgctcgg gttcagatcg 14280 

gcgagcattt cggggggaag ctgtctgagg gcgacatcga tgcggtggga tatccaggcc 14 34 0 

cgggcgagtg agtcgggggt gtcgtagaag tggagttcga tcggccggcc ggtgttctcg 14 400 

gcggcgccct tgtaccgagg gttgggcgag agggagatct tctcgccctt cgcgtaggag 14 4 60 

acgacgccgt acggtccggt cccgtcgatc cggccgtccg agcgcaggga gtccgccggg 14 520 

tacgtggtcg agtcgacgat cgagcccgcc ccggtcgtca gcttgaaggg gaacgtggcg 14 580 

tccggtgcgg tcagtcggaa agtgacggtc cggtcgcggg cgtccatcga ctcgatggtg 14 64 0 

tccaggaggg acgacggccc cacgtcggaa tctatcttct tgacccgttc gaacgagaac 14700 

cggacgtcct tggctgtcat tctgcgtccg ctggagaagg tgatgtcatc ccgcagccgg 14 760 

catcgatagg tgcgtaggcc ggaatcggtg aaggagcagc tttcggctgc gtcgggaacg 14 820 

ggctccgcca ctccgggctc cagggtcagc agtgtctgga agacattgct gtacagagtg 14 880 

gtcgagccgg agtcg-tagcc gccggccggg tcgagtgacg tcggcggttc cgtcgtcccg 14 940 

accttgatgg tggtgccctc ctggtcgtcc gtcgggtaaa gcagcagacc ggctgccagt 15000 

gccgtggcga tcaccgtggg tgtgatgacg gacgcacgga tgtgcgcacg aataggtctc 15060 

atgaggctcg tcctcgcaag atcgagacga acaggaattt tcgtacccct gggtggagag 15120 

tgcgtcggcc aagtatgcgc aggcgtcgct tccttcggag cccgacggca cttccggaac 15180 

gaagtcttat gactgacacg gtggaactgc tatgccccgt tcggcgagag ggccgccagg 1524 0 

ggtcggcacc ccctctcagc agccgttccg cctcgtctcc ggtggtcctg cggacccgct 15300 

tgcgcgggtc ccgcccacgg tctcactcct cgatgccatt ccctgtgcaa tgtcacctgt 15360 

gccatgttcc gtgttgcagg gcgtggccat gccaagtcgg gaggtcgttc gtcttccgtc 15420 

aggtggcagt gcggtactcc gtttcccacg tcctctcccc cttcagtcgg ccgtgctccg . 154 80 . 

cacggccgga tccctcatgg gaggcgctgt gagaaagtca ctggtacggc gaggtctggg 15540 

ggcggcgctg ccgctggccc tgaccgtegc catgagcgtg ggcctgctgt cgcagccggc 15600 

cggcgcagcc gggaacaccg ggtccgtcgt gcacgtcgcg gcggacgacc cggagcacgc 15660 
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gggacccccg 
cacgagggcg 
cggcaagacc 
caagaccggc 
gagcggcggc 
gttcaacctg 
cgcctacgcc 
gcagctcgtg 
gggcacctac 
cagcccgcac 
gctcatcgac 
ggactacgac 
cacggtgacc 
cggcctgatc 
ccagtcctac 
actgcgctcc 
ccggacggcc 
ctgctcctac 
gacctaccca 
gtcctccagc 
caacggcccc 
caccgactac 
gtacctggag 
cgagtggctg 
cgacggccgc 
gtgggtcgaa 



cccgtcgcgc 
tccgagcttc 
gcgaaggcgc 
aactcccgtg 
gcgctggtcc 
accgggaacg 
ctgcgcgacg 
ctctacctgc 
ggcagcagcc 
tcccgcgacg 
agcgccgagg 
tccacctgga 
ttccgcggtc 
gacgcgctct 
ctcacgtcga 
aaggtcagcc 
ccgctgtggg 
tacggcacct 
tgcagctcca 
tgcagcagcc 
gtcgcgaacg 
cagacctacg 
gggaatccgt 
cgtccggact 
ttcgacatgt 
ggcttcgccg 



agtcccccac 
cgcccgtggc 
cggtccgtcc 
cgcgtgccgc 
agcagatcaa 
acgcctacta 
gctcgacgtc 
gcgccggcta 
tgcagaccgc 
tcaacgacgc 
agaacgcccg 
actcgtcgtg 
accaggtgcc 
acaacttcgc 
acgcgggacg 
ctctggccgg 
tcggtgtcgc 
gcgacctcca 
gcatcaccat 
tgcgcaacca 
acaacaacag 
ccggcgcgat 
cggcggccgg 
tccagatctg 
acggcgactt 
agtacgtctc 
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cgccgagacg 
cgcgagtaag 
ctcgaagtcg 
cgcgtgcaac 
ggcgtccacg 
cgccttccgt 
ctacccgggc 
ctacgtgcac 
gatacgcgcc 
caacggcgag 
ctacatccac 
gtggatgctc 
cgcgttcgtg 
gagcggccac 
tgaactcggc 
cggcctgctc 
cgagatgacc 
ggcacaactg 
caaggcgcag 
ggacgcctac 
caccatcgag 
gtacgggatc 
caaccagccg 
gaacctcaac 
caacgccaac 
ctactcctac 



gagcacgtcg 
gacgcgctca 
acggacaagg 
gtctccgact 
accgactgcg 
gagtcgcaga 
aacgcctcca 
tactacaacg 
gggatcgacg 
acgctcgccg 
gtcgtcaagc 
aacgcggtca 
agtgccgtgc 
ctcgcgctgc 
cggttcctgc 
aactccagct 
gactactacg 
gcccgctccg 
cagatgacct 
ttccacaacg 
gtcgtggtct 
gacaccaaca 
cgcttcatcg 
cacgagtaca 
atcaccaccc 
cgcggcgtcc 



cgcagggacg 
aggaggtgta 
cggtcgccgg 
tcaccagccg 
tcaacaccct 
tgacctcggt 
ccggtatgcc 
ccggcacggt 
ccttcttcgc 
aggccgtcac 
gactgctggc 
acaacgtgta 
agtctgaccc 
tgggaacgga 
agcattccgc 
ccatcaaggg 
acaaggccaa 
tcctgacggt 
cgggcgagct 
tggtccgtga 
tcgactccag 
acggcggcat 
cctacgaggc 
cccactacct 
cgaccatctg 
cctacaccga 



15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
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ggccacgacc gaggcggggc gtcgcacgta cgcgctgagc accctgttcg acaccacgta 17280 
cagccacgac accacgcgca tctaccgctg gggctacctc gccgtgcggt acatgctcga 1734 0 
aaaccaccgc gccgacatgg acaccgtcct cagccactac cgcgcgggaa actggaacgc 174 00 
cgcccgcagc tacctgaccg gcaccatcgg cacccgctac gacaacgact ggtacacctg 174 60 
gctggcggcc tgcgcggccg gcaactgcgg tggcgggggc accaacccgc ccgggaacca 17520 
ggcgcccacc gccgcgttca ccaccgccgt ccagggcctg aacgtcacct tcaccgacca 17580 
gtccaccgac gccgacggca ccatcgcctc ccgctcctgg agcttcggcg acggcaccac 17 640 
ctccacggcc accaaccccg tcaagacgta cgggtcggcc gggtcctaca cggtcaagct 17700 
gaccgtcacc gacgacaagg gagccaccgc caccgccacg aggacggtca ccgtcggcag 177 60 
cggcggaggc ggcggcaccg aatgcaacgg gaccgacacc cgggaactgg gccagaactg 17820 
ccaacgcggc aaccagtccg ccaccaccgg caactacgcc tacctgtacc tctacgtccc 17880 
ggccggcacc acccagctga agatcaccac ctccggcggg acgggcgacg cggacctgta 17 940 
ctacagcacc agcggctggc ccggcaccac gagctacacg cagcgggcca cgggagccgg 18000 
caacaaccac accctgacca tcaccaaccc gccggccggc gccaactaca tcagcctgca 18060 
cgccgtcagc agcttcagcg gcgtcaccgt gagttccgcc tactgaccca cggctccgca 18120 
ccaaggcacg accctcacga cggcccgggg cggctctccc cgccccgggc ggcgtccggg 18180 
gcggcggcag gggggagacc tccgtcgccc cggaccgaga acacatcgcc cgcccgcaca 182 4 0 
cgggcatccc tacctcccag gaggcagagc gtgaagtcat tacccgcacg caggcgacgc 18300 
cgcgccatgt ggtccctcat catgtccgtc ggtctcacct gcgcactcgc cacacccgcc 183 60 
gtcggcagcg gtgaccaggg cacgtcacgg ctcagcgcct cgcaacaggc cgcggccggc 18 4 20 
caactcgcag cggaccagca catctccacc " caggaggcac agcggcgcgt actgcggcag 184 80 
gagcggctca ccggcgtcgc aacagcgctg cgtgagcgcc tgggttcccg cttcgcagga 1854 0 
gcctggatcg accagaagca cggcggcagg ctgaccgtcg ccgtcacccg gtcgacggcc 18 600 
acggccctcg tcgaggcccg gtccgctcag gctcaggcac ccgacacgac caccgtcgtc 18660 
gtcgaccgca gcctgcggca actcgaccgc atgtccgcag gactggccca ccgtatcgcc 18720 
gcagcgaaca agggcgccgc ccacggcctg cagtccgcgg tggtggtgca ggacaacaag 187 80 
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gttcgtctgg acctgccacg gggcaagacc ctcacccccg cccagcacgc agtcgtggag 188 40 
tgggcgaagc ggaccctcgg cgatggcctc gaggtcagca cctacgcgca tgcctccgaa 18 900 
cccttctact gcggcggcca gtactcgtgc gaccccccgc tgcgctcggg cctggccatc 18 960 
tacggcacga acgtccgctg ctccagcgcc ttcatggcgt acagcggcag cagctactac 19020 
atgatgaccg ccggccactg tgcggaggac agctcgtact gggaggtccc cacctacagc 19080 
tacggctacc agggggtcgg tcacgtcgcc gactacacct tcggctacta cggcgactcc 1914 0 
gcgatcgtca gggtcgacga ccccggcttc tggcagccgc gcggctgggt ctacccctcg 19200 
acccgcatca ccaactggga ctacgactac gtcggccagt acgtgtgcaa gcagggctcc 192 60 
acgaccggct acacctgcgg gcagatcacc gagaccaacg caacggtgtc ctacccaggc 19320 
cgcaccctga ccggcatgac ctggtccacc gcatgcgacg ctcccggtga cagcggcagc 19380 
ggcgtctacg acggctcaac ggcccacggc atcctcagcg gggggccgaa cagcggatgc 19440 
ggcatgatcc acgaaccgat cagccgagca ctggcggacc gcggggtcac gctgctggcc 19500 
ggctaagcag cccgggcgga ccgtgagtac gccgccccgg tcacatcacg aggacgtcga 19560 
ccgccgcacg cgcggtcggc gtctttcccc gtgctccgct ccgtccgcca cccagcggac 19620 
tgggggcggg ggcgtggcac gtcgtgcacg ccgcagcgcg gtggaacccg tcggccgatt 19680 
agaccgtacc ggggagcgcc tttccggctc cgttcgtggg acgggcgggt gcgtatgcgc 19740 
gcgtcaccca tttctggaag tgcggagcct gcgacagcag ttgccagtgg gcgcgtacgg 19800 
catgatggtg caccacctcg acggccgacg cctcgaccga atcccgccgc cagacgagca 198 60 
gatgccgctg ccacagcgga tcccccgcga ggggtttaac cagtactccg cccaccgggc 19920 
gcatggtggg ctggacggcg gccaccccca gacccttggc gatcatcgac tgcagttggt 19980 
cgagcatgtg gaactcgtgg gtgacggcgg gcctgaatcc cgcggcccca caagcgtcgt 20040 
agaaggcgcc gggccagccc accccgtcgt ccgcggagac gaaccacgcg tcctccgaca 20100 
ggtcggccaa ggacacctcc agccggtgcg ccagtgggtg atcggcaggg gtggccacga 20160 
acaccgggac ggttctgata gctcggtggt ccagcttcgg agagtgtcga agaggcagcc 20220 
ctgggtagtc gcaacccagg gcgacgtcga gctcgccggc ctctaggaga tcgatgagtt 20280 
ctccggtcgc gtacacactg ctgaccgaga cggtcagatc ggggcaggct tcacggagga 20340 
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cgtcgagcaa ggtgggtacc accggtgtgt tgatggcccc gaggcgaagc cgacgtgtcg 20400 
ccccggacga gcggggaggc cgcagccgtg cgagattgtc ggagagcgcc aggatctccc 204 60 
gggcccggcc gacgacctgg gcgccgtagg cggtgagctc cacgcccgcg ctgctgcgca 20520 
ggaagacccc ctcgacgagc agtccctcga tgcggcgcag ttgggtactc atcgccggct 20580 
gggtgtatcc gagcgccgca gcagcccggc cgacgccccc cgcgtcggct atcgcacaca 2064 0 
gcacgcgcaa gtggcgcagc tcaagttcca cgggggcacc tcgctccggg cgaacagagt 20700 
tccattatgc gccaggagga aggcggtggg gaatccggga cggcctgacg ccttcggtcg 20760 
accagtagcc cgagggttat ggatgagccg gagcctctgg tatggcctgg ccggttgttc 20820 
ccgggtgacc gccgtggaaa tctcggacct gcgtgttggt ccgcagaggc gactgcggaa 20880 
gcctgaagcg caccgccatc gaggagcgac atcatgcctc acacctgcat cagcttcacc 2 0 940 
gtcgaagcga ccggggccgc ggttcaccgc gcccgccacc gcgtctccac cgcgctgagc 21000 
tggtggggag ggccggtcga ggaagagctc cgcttcagcg cggaactcgt gacctccgag"2 10 60 
ctcctcacca acgggctgcg gcacgcgggc gggcccatga ccgtcgagtt gacgctggtg 21120 
cacgacatgg tcgtcgtcgc ggtcctcgat gacagccggg agctgccgcg gcctcggcag 21180 
acggaggcgg acgacgagtg cgggcgggga ctcgccctga tcgaggacct cagtctgata 212 4 0 
cggggagtcg agaccacttc ccgcgggaag cgctgctggg cggttctgcc gctgcggacg 21300 
ccacaggagc gggctatcga gtcggctccg gctgaggagg cggaccacgg cttcgaggca 21360 
gaccgggaac gctggtcact ggctccccaa ggaagcggac tactggcgag tctgtttccg 21420 
gcgatgtgag ttcgtcctcc tcgggcggcc cagtagccga cccagggcag gcgggcgtgc 214 80 
ctgagggcgt gatgacgctc gtctgacgct ctggccgctt tcaagctgca cagcgagccg 21540 
agaaacagcc tttgacctgg ccttttctgc ggctgcctca ggccgacatc tttccgatga 21600 
cgcaccacgt ggagtacgtg gcgattctgg agcctgctgg caaggggttc tgacctgcgc 21660 
ttttgttctc ctgcggcggg cgcggcaagc tcgtgcgggg cagttgggtt * tcccgaaggc 21720 
cggtgctcgt gtgtccggcc ggcgggtggg ctgccttcgt ttcagtgggt gcgagagggc 21780 
actcggacgc ctgagccgag atgcggttcg ttcggcacca tggggtccgc aggatgaccc 21840 
ggtcagcgac cgctggcacc tgtggaagaa cctttgcgac aaggccctgg ccgaggttcg 21900 
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ctcccacagc 
gaccacccgc 
cgaatgcgcc 
cgatcctgaa 
caccggcagc 
cgtcggaggg 
ccgtctgccc 
atcactgttc 
tagacattgg 
ggcagagacg 
gtcgaagcca 
gtggtcaggg 
ggtgaaggcg 
agagcagacg 
ctgagaggtt 
cgccatcagg 
ggtggtctcc 
gcggacacag 
gccctggtgc 
agcagacgat 
ggcggccgag 
gatcacgccg 
cgcggcccgc 
cgtcatcctg 
cccggccggc 
agggctcgcc 



gcctgctgga 
gaacgttggc 
cgccgcctga 
gccctgcgcc 
ttcaccctgc 
tcgcggcttg 
ggggcttggg 
gagagtgatt 
gcagcagtgt 
aactgccggg 
ggatggtgca 
gccgccaccg 
tcggctgcgg 
caggacgggc 
cgtgtcacca 
atcgcccggg 
ggtcaagaaa 
aaggccggtg 
cgcatggcac 
tcgtacggcc 
atgctcggta 
ctccgctcgg 
gcccgcgaac 
gaagaccagc 
ctggtggaca 
tgccaagacg 



ccacagcgaa 
atcagctcca 
acctgtccct 
ccgtgaagca 
tctaccgcag 
acgctcaccg 
tcccggctgc 
acctctccgc 
ggttatgttt 
cagcagtacc 
ggacggcgac 
cagtgcagta 
acgcgcgcgc 
aacggggccg 
gcagtagagc 
cgcagttttg 
ccgcgatccc 
cagtatcagg 
cagggcccct 
gtctcgacga 
cgacccccgc 
agggcggcca 
tcgtcgacca 
tccaagaagc 
aggccgaggg 
ggaagccctt 
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cacaccccgc 
cgacctcctc 
caacaccgtc 
gctgtttcgc 
cacccagggc 
tatccatcac 
ggcccgtagg 
cggacacatg 
ctcctgtaac 
cgcagttgca 
gggactgacg 
cccagcagcg 
cgggaggttc 
actgtcggac 
agtaccagag 
ggcccgggta 
cgcgcccccg 
gccggcagat 
ccatgcgttc 
cgacgattac 
tttcctgcgg 
ccgccgctac 
gggcactccc 
gcggcgtatc 
ctgaggccgc 
gccgcaacga 



ccggtcggcg 
ggcaagggtg 
aagcgctacc 
gaggt ccagg 
cgggcagaag 
tggaacggcg 
cccggctcac 
gaaatctgca 
ccagaaggac 
ggacggtgcg 
accggaccgg 
aagtcagtga 
ggcagtggtg 
agtggcgctg 
gaaagaacgg 
ccgcaggaca 
gcagcaggca 
ggt gtaggag 
cgcagagagg 
cccgcctaca 
gccgtcggag 
tcccgctacc 
gt cgaggcgg 
aacgaggaac 
atctgccggc 
gaagaggcaa 



tccatgagca 
tcggcttgct 
cgcgcacccg 
agcagggctg 
gcgaccggcc 
acgtctgatc 
cccagcaccc 
tcggctggag 
cgcagggccc 
gtggtggagt 
gcggcccgca 
gcggtacctc 
gttccaagcc 
tcacaggtca 
aggaaccaag 
tcgatagtga 
ggtcgggtcc 
ttccttcggg 
tgcagatgac 
ccatggggcg 
aagcccggct 
agttgcgcat 
cctgccgcat 
tgcagaggcg 
cggtcctgtg 
ctgtccgcac 



21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
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cgatgtgctg ggcccggtcc tggctaggac tcccgtcttc ttgccggagc gatgcggctg 23520 
tggacgcgga accggacggc agtgtcgtcg ggcgcggagc gcggggcgca cgtcgatggc 23580 
gacaggaccg gcgaaggtgt attcgtgttc ggcggtgtga cggcgcacct ggccggcgag 2364 0 
ggcggcggcg caggtgtcac agggacatcg gttccgactt ccaccacccg tccgggttcc 23700 
accagcgtgt catccacctg atccaggctg ccgcggtagg tgctcgacgt cgggggtgta 237 60 
cgggggcagt tgtaccgttc cgccgcgagg agtgacccga ttgaccaccg gcctgtggcg 23820 
ctcaggaacg ggctggactg tcgcagtccg ggccaactca agcccgacca tgaggccgac 23880 
cacggcgccg cgcgaccccg accacagcta cacgcgtggc atgaccaagg cggcacatgc 2394 0 
ttcgaacgag ccatctcatg tgtgccggta tgaacgtgat cgacgtcccc. ggcactctgg 24 000 
tgcggacgca agccgtctgg ggcgccaccc acgactggct cgccgccccg cccccgcggc 24060 
gccaccgtcc gtcccgccct gcgtcgtcgc ccgtgtggcg tcatgacggc gacagacttc 24120 
"ctcgcgtatfg "ggccgaccat acggccaacg "cca'gaggtaa agcgctgtcc atggtgagtt 2 4180 
ccctgaacag aagggctggc gggacctcct ttccaagacc gtgctgcagg agtccgtcag 24240 
agcgcaggta atcccgtgct gtccgcgacc cagggctgtc ctccgtctgg ccgagggtcc 24300 
tcgtcttctg ggcgacatcc ctttagcgtg ggcggtagcc gccgaaggga ggcgccatgt 24360 
cggacgaatt gacgggcccg ttgggaacgg caatgcggga ggtcacgttt ccggaccggt 24 420 
ctcgcgggat catcttggtg cgggctggaa caccgcaggc cgaggccgag gcaatggccg 24 480 
cccgtatgtg ggccgagatg ccggaaggct gacgtgcccg aacgcagaca acccgtaccg 2454 0 
tcctcacacg cattcccctg agccgtcggc catggaacgg aaccagccgt acgaaccccg 24 600 
gaggcgccgt tgcggtctct gcggcgaggc cggggccacg cagggcgaag aggccgcgcc 24 660 
gcgttctgcc gcctggcgcg gctgccggct gttcacgaga acaccgaggg aggagtcgcc 24720 
cgcctcttgc ccggcgcgtt gccgggtgga gagcaggtgg tgaaggactg gctcgctgaa 2 4 780 
ggcggccgag gcgacctcgt cggccggcct gaacggcttt cactgtccca gcggcggcag 24 84 0 
gccgccgaca caggcatgct ttgccatctc cctcgctgtc tactgacccc agcagcagga 24 900 
tccagtacgg cgtcgcggcg ctgccgcctc actcgcgcat cgatcgggga atgcggcatg 24 960 
tggtgagggc ccggccggcg tgccggtcgg gccctcacac tgttttggtg tcggcgcgtt 25020 
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tgtcgtgtcg gtcagacgga caggtggggg gcgccgagca tggcggaagc ccgctgcaac 25080 
ggactgtcgc tgcgcgcggg ggcggcctcg ggaagggtgc ggcaggtgaa gcccagctgg 2514 0 
gccatggccc ttaggatttc gccggtgctg aagtcgcggc ggtcctggcg ggtgatgacc 25200 
tggccgacct gcttggcggg gtagtggcgt cgtccgatga tcacggactc gccggtgacc 25260 
ggttcgggtt tgacgccctt catcgattcc agcacgccgc tcttggtcag gtcgaacggg 25320 
aagcgggcaa tgacacagcg catgatgcct cacaggcagg agagttacgg ggccggccgc 25380 
cgtctggcgg ttcagcggga gagagcgagg acgcccaggg cgctgccgtg ttcgtcgacc 25440 
acgggcacca gcccgagccg tccgaagggc accgcgtcct cggcttcctc cctcgtggcc 25500 
gacggtgaga cgaagggctc gctgtcgtcg gtgatgtcac cgaggcggag ccggtcggtg 25560 
tatcgggagc tgtcccggac ggcggtgagc cgggcctggg tgaccaggcc gacgcaccgg 25620 
gcatcctcgt cgcagacgac cagatgctcg gcacgggcgg cggccatcac ggacagcgcc 25680 
acctcgacgg tcatgtcgta ccagacctgt ggcccggcgg cgtccatgac gtcggccacc 257 4 0 
gtgccgcgca atgggagagc gcctacggag cgatcctgca actgtcctgg cgtcaagggg 25800 
tgcctcctgc gcagacgggc ggggttcctg atcaggacgg tcctaggcgg ccgcgccagc 25860 
cgtggacttg agtgcggggg tacgccgcgt cgccgaggcg gggcggcgtc ggccgcgtga 25920 
ggtggcgccg cgcttcttgg ggcgttcagt cgccggggcg gtgatgacga ccgggatgcc 25980 
ggtcggggcc tgggctccgg tgatccggct gagggcctcg tcgcccgggc tgacctgggt 2604 0 
ggtctgcggc cggatcccgg cttccgacat gagacggacc atgccgcggc gctggttcgg 2 6100 
ggtgacgagc gtgacgacgc tgccggactc gccggcgcgg gccgtgcggc cgccccggtg 26160 
gaggtagtcc ttgtggtcgg tcggcgggtc gacgttgacg acgaggtcga ggttgtcgac 2 6220 
gtggattccg cgtgccgcga cgttggtcgc caccagcacg gtgacgtgcc cggtcttgaa 26280 
ctgcgccaga gtgcgggtgc gctgcggctg ggacttgccg ccgtgcaggg cggcggcccg 26340 
taccccgctg ttgagcaggt cccgggtcag tctgtcgacg gcgtgcttgg tgtcgaggaa 26400 
catgatcacg cggccgtcgc gtgcggcgat ctcggtggtg gccgcgtgct tgtcggcgcc 264 60 
gtggacatgg agtacgtggt gctccatcgt ggtgacggcg ccggccgagg ggtcgacgga 26520 
gtgcacgacg gggtcgctga ggtagcggcg tacgagcagg tcgacgttgc ggtcgagggt 26580 
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ggcggagaac agcatgcgct ggccttcggg acgcacctgg tcgagcagtg cggtgacctg 26640 
cggcatgaag cccatatcgg ccatctggtc ggcctcgtcg aggacggtga cggagacctg 26700 
gttcaaccgg cagtcgccgc ggtcgatgag gtccttgaga cgtcccggag tggcgacgac 26760 
gacctcggcg ccaccacgca gcgccgacgc ctgcctgccg atcgacatcc cgcccaccac 26820 
cgtggccagc cgcagcttca cagagcgggc gtacggggtg agcgcgtcgg tgacctgctg 26880 
cgccagctca cgtgtcggta cgaggaccag ccccagcggc tgccgaggct cggcccgccg 26940 
gccggccgta cgggccagca gagccaggcc gaaggcgagg gtctttccgg aaccggtgcg 27000 
cccgcggccc atgatgtcgc ggccggcgag ggagttcggc agggtcgcgg cctggatcgg 27060 
gaacggcacg gtcacccctt gttggccgag cgcggccagc agttccccgg gcatgtcgag 27120 
atcggcgaag ccctccgcag cgggaagcgc gggggtgatc gtccggggga gggcgaactc 27180 
cccctgaacg gcgccgggcc ggcggccgta accgccggag cggctgggtc cggccggccg 2724 0 
gcgcggcgcc ggcgaaccga agcggctgcc -gccctttccg gagtcggcac cgccatgacg 27300 > 
ggtgcgagcg aagcggtcgt tcgtgcgtgt gcggttcata cggaaccttc ctcgatgcgg 27360 
cacatatcaa ggaatttccg aagcaatgag cagcacggag aatcgcaaga atggaccggt 27420 
gggccttgcc agcggatctg gccgacagaa aatctgtgcg gcacgtgcgc tggaatgatt 27480 
gggggtgctg tgggctcgat attcgaagcg tccactgcac tgtagctatg aaggatgcgg 27540 
ctgcaccttc gaaggacgat ccgtgtgcgg taaacacacg ctgtccggag cgtcgtccgc 27 600 
aggtgaaatc actgcgggaa acgcatgtag ctggggcccg caccccgaag gatgcgggcc 27660 
ccagctacaa gtacgtgaca gtcggcgtca ggcgggaacg atgttctcgg ccgtcgggcc 27720 
cttctggccc tgcgcgatgt cgaagttcac cttctggcct tcgagcagct cgcggaagcc 27780 
ctgggcggcg atgHtcgagt agtgggcgaa cacatcagcg ccgccaccgt cctgctcgat 27840 
gaagccgaag cccttttccg cgttgaacca cttcacggta ccagcagcca tgtcatttct 27900 
ccttcggggc agtcgtacgg gatccgcacc gcgcggacct cgtgtcgccg caatgatcac 27960 
cccgcccgga aaaagaccgg agatgtaaaa gtgcttccag gggtactgag cccgaccgga 28020 
gcacttgaaa tttcgggaac cacaactgca actgacatcg acagtagcac gccacagcag 28080 
ccactgtgcg gtgaagaacg ccaccttgct tattgcggca gagaatctat ccgcatgctc 28140 
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cgatgaaaac 
gccgcgctcg 
cgcgggggca 
ccggtcgggt 
gtcttgcctg 
gttctgcgcg 
cctcaggagg 
ttgccgccgt 
tccgtcaccc 
ttcgcgcggt 
tccgcgatcg 
ccgaagtggg 
ccctcgtact 
gccaggtcgt 
acttccacat 
atcacgtgcg 
cagctcttct 
acgagaatgt 
gcttcgtcga 
tcggcggtct 
gttcggccgg 
cccgccccgc 
ccggcctgct 
atcgttcacg 
tcccggccat 
gggtgacgcc 



tcaaaccgcg 
cgtgatccgg 
gacaggtccg 
gtacgtctcg 
gcaaccggct 
cccgcttccg 
cgtgacccgg 
aggtgacgtc 
cgtatgccct 
ctgggtcggc 
cgaacgtcgg 
agagcatctg 
ccgccagctc 
agtagatgtt 
gcagcggcgc 
agcgcgcgcc 
tcccgaacac 
cgagccgctc 
gatggtcggt 
cctcgatggt 
tcacatacac 
gggtagcgcc 
cggtgttctc 
ctaaaaccga 
agcgccaact 
ggtcaggtcg 



cgcacagata 
tcccccacca 
gccatgacgc 
cgcgcggcga 
gtcggctcgg 
tgaacgcgcc 
ggaagccgag 
ctcgaagtat 
ggccagttcc 
ggccagcgcc 
ctcttgggcg 
ctccgaacgc 
ctgagccagc 
ctcgcggtag 
gtcggagcgg 
cagctccagg 
cgaggtggcc 
gtactcccgg 
gggaactgcg 
ctcggtcgtc 
ggtagcgccg' 
cgccacgagg 
agtggtctgc 
cagaacacgt 
acgtcctcgc 
gggtaggggc 
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ttgaccttcg 
cgctctccgc 
cggccatgct 
gcactgcggg 
gctggttggt 
gcagccccgg 
gatcagcggt 
gcccaggcat 
ccgctggagg 
gcgaccgtac 
atcgcgtcac 
aggaagcccg 
ccgaacgcga 
cggcggttgg 
atcagcagcg 
atccgcaggc 
agaaggtgct 
tcgatccgct 
attceggtgc 
cggccgacct 
gcccgcccca 
gcgatccgtc 
ctggtgatgt 
caccttttat 
actgagcgtt 
cgcaacgcac 



cgcgacgcca 
tactgcacgg 
cggggcgtag 
ggaggggccg 
cagccgtggc 
ctgccttcta 
agtcgtcagg 
ccggccggct 
tggacttgcc 
gggccaggta 
gccagttctc 
gggacaccgc 
ggcggatcgg 
agtgcgcggt 
gaagcagcag 
cgtcggcgag 
cgccgcccca 
cgacgagggc 
cgcccgctgc 
cgctggcccg 
gttccacagc 
ctgccagcgg 
cctcgttgct 
gtggggggta 
ttcagcgtgg 
aaggctcgcg 



tatatcgcat 
gtcgcaccgc 
cggacgcctg 
gttgccagac 
aggtgatgtg 
ccaggccgac 
ggaggcttcc 
gccgtccacg 
gttccaccgc 
gtgcggggac 
ctcactcaca 
gaccgccgtg 
ggcgttcttc 
accgtcggtg 
cgccgccgtg 
cggtgtctcc 
caggtcgttg 
gcggacctgg 
ggtgacgagt 
ggcccgggtg 
ctgagctcgt 
acccttcgga 
catgtcatcc 
ccgcgcatca 
gccaccgatc 
tgcacgacat 



28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 



WO 99/37660 PCT/US99/01313 

-106- 

ggccaccgcg cgcatgatct cccagcggga gcccagccgt ccccggcagc cccagccgct 29760 
gagaccagct cacccgggac acccggtccg acaccgcaca cgatcaagta gtcgacctcc 29820 
agacgcgttc agcagcccac atcccaggag ccgtctaccg tcccaggaac ccctgctccg 2 9880 
ggaccatcgg gctcggcacc gggagtgcac agttgatcag taactggcaa cgagctcgtg 29940 
cacggtaagc ggtgaggtgt cgaggtccag atgggcggcg gcggtggtgc ccccagcggt 30000 
cggccgaccg gcatgccgag cgggcagccc accggtgtgc cgagcggcgg acccggcggc 30060 
ggcacgggca tgggcggcac ccccaccccg cagcacctga agtcggtcag gaccggccgc 30120 
gtgacgggct tcgggtcaga cctgtgcggg gaacagcagg cagtcgtccg ggcggatgat 30180 
caggttgatc tcgccgtccg tgtgccggac ggggctctcg gcatggacgc gcacgtcgcc 30240 
gatgctgagc tcgtactcga atcgcgctcc ggtgtacgag cactgctcga tcctcgcccg 30300 
gagcacgttg acggcaccgt cgtgcggggc gtcggcgcgg tcggtgagcg tgatgcgttc 30360 
cgagcgcagg cccacggtgg cggacgaccc-cgcggagcag gcgccggcca ccctcaagcg 304 20 
ctgaccggtc tcacccagtt cgacctgtac ggctccgccc tcggtggcgc cgacgcgccc 304 80 
ctccaggagg ttgcagcggc cgatgaagcc ggcgacctcg ggagtggcgg gagtctcgta 3054 0 
gatctcggtc ggtgtgccca cctgctggag gtgtccgtgc atgaacacgg cgatgcggtc 30600 
ggacagggac atggcctcga cctggtcgtg ggtgacgtac acggtggtga tgccgacctc 30660 
ccgctggagg tccttgagcc agacgcgggc ctggtcgcgc atcttcgcgt ccaggttgga 30720 
gagcggttcg tccaggagca gcacgccggg ggagtagacg atgcctcggg cgagggcgac 30780 
gcgctgctgc tgtccgccgg agagctggtg ggggtagcgg tcgcgcaggt gagccatgtc 3084 0 
gaccttggtg aggacgtcgt cgatgaggcg ccgttgctcg cccttggtga ccttgcggag 30900 
cttcagcggc agtgcgaggt tgtcggcgac ggtcatgtgt ggccagagcg cgtacgactg 30960 
gaagaccagg ccgagattgc ggccttcggg gggcaccgtg ctgcgccggg tgccgtcgaa 31020 
gaagacctgg tcgccgacac ggatggtgcc cgagtcgggg gtctccagac ccgcgacgca 31080 
cgacaaggtg gtggacttgc cgcagcccga cgggccgagc agagtgaaga actccccgtc 31140 
cgcgacggtg aagttgacgt cctccaggac cgcggtcccg tggaaggact tcttgatgtt 31200 
ctcgacgacc agctcaggca tgcttcttcc ccttcaggag gagaccggcg aggccggcga 31260 
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cgacggcggt 
gggtccacag 
tggcgggggc 
cgggcacgag 
cgcgtgcggc 
acgccgacgg 
ccgcgtcgcc 
ccgggaccgc 
agcggcgtgc 
aggccacggt 
ggaagttgtc 
acgcgcgcag 
cgaaggcgaa 
tgccgcgcac 
gggcgatggt 
tgatcgagtc 
cgcccaggat 
cgtagagcat 
ccgcgccgac 
cgcagaacag 
gtgagtacag 
cggccgggcc 
gccagcgcag 
tcgccatcgg 
tgtcgaggac 
tgaaggcgcc 



gacggcgatc 
atcgatggcg 
gtactcgcgg 
gagacggagc 
gttgtcgagt 
gagtgcccgg 
gccgggtatc 
gtgcggtgac 
cacgaggacg 
gatgctgttg 
cagggtgagc 
gatgagcgcg 
gcccagccac 
cgacacgaac 
ggtgagcagc 
gatgccctgc 
gagcgggatg 
cggcgggcgc 
gctgcgggcc 
gtaggcgatg 
gttccagggc 
gtagacgacg 
cacctgcccg 
caccgcgatg 
gatcgaaccg 
gccggccgcg 



tggagggtgg 
gtggtgccga 
atcatctggg 
atgatccggg 
tcggctccga 
atgccgaagg 
caggtgaagg 
tgcgctgtcg 
accaccgtgc 
acgatcgact 
aggtcgaacg 
agcatcggca 
ttccaggcac 
cgctgggcgt 
atgaaggtgg 
tcgtagagga 
gacagggtct 
agcatcggca 
gcgttctcca 
gggacctggg 
acccagccga 
atccacccga 
aacaggcggg 
gcgagcgcga 
aagcccgccg 
tacaggggct 
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cgagggcggc 
tgacctgtga 
tccagatgag 
acaccgtgcg 
gctgcatggt 
cgatgatcag 
cccagaacag 
tctccaggag 
cgaacagggt 
cggtgtaggg 
ggttcaccag 
gcagtgcgcc 
cgatgtcgag 
gccccagcag 
acgccgcgcc 
agttggagaa 
cgatcgccgt 
ccacgaccga 
gagaggtgtc 
cgacggcctc 
agccctcgcg 
aggccaggac 
cggcggggaa 
acacggtcgt 
acgtgaacag 
ggttgcggac 



cacggagccg 
ctcggctccg 
caggaacgag 
ccaccagtcg 
cgccggggag 
cgcgaagagc 
gccgatgccg 
acgggcgaag 
cacggccacc 
ggcgtagtcg 
cggagtgagc 
gaagacgacg 
caggtcggag 
ccgcgtctgg 
cagcaggccg 
gagggtgatg 
gccgaagatc 
gcgcaggacg 
ggaggcggcc 
gacgaacgcc 
caccgcgccg 
gagcggggag 
gcgggtgcgo 
caggacggcg 
gtgggtgtag 
cgactggtag 



gtctcaccct 
gcgaggaaca 
gcgagcatcg 
gcgccggcga 
atcgcgccgt 
gtgccgcgca 
acgatcaggc 
cggaagtcgg 
gcccccacga 
aagatcagac 
agcgaggtga 
tacagaccga 
cgggtcgcct 
aacacgacca 
tagtccggat 
ccggcgggct 
agcagacccg 
cgcagaggcc 
agcgcgttgg 
ataccgggca 
gtcaggaagc 
atgtagatgg 
tccagcagaa 
aagaggaggg 
ttcgagaggg 
aggatcggta 



31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
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cgacgggggc 
cacgtccggc 
tgcgggacgg 
ggggtgggga 
gcgcgacgta 
ggtccgcgcc 
ccggccggta 
ggaagtccag 
cgcgcggcat 
gctcggcggc 
ggccagttcc 
gggccggacg 
gaaggtgaat 
ctccaggggc 
ggctatcacc 
gccggcgccg 
cgccgcggct 
atggacgatg 
cgccgatcag 
acagcgacga 
gaccgtcgac 
gccgacgcgc 
cgacctgccc 
cggccggtgt 
tggcacccgc 
ggccgtttcg 



gaggacgagc 
gccgaacagg 
cgcgggtggc 
caggggcgtt 
cgcctcccgc 
gagcctgcgc 
cggcaccagg 
gaagagacgg 
gacgacggtg 
gggtatcgcg 
tggcgcgagt 
agggaeaggg 
tcctcctctc 
agtacaccca 
gcccgcactc 
tagttgcggc 
tcgtggacgg 
cggccgggtc 
gacgacgccg 
ccccggtgac 
cgcgccgcct 
cgccgggcac 
acctggtgcc 
acgccccgaa 
tccggcggcg 
agtggaccgc 



acggcggtga 
cgccggtact 
gccgggggtg 
gggcgggcgg 
acgcgctccg 
cgcatgtcct 
ccgccctcgg 
gccgcgttcg 
ccctccgcgt 
acgccatcac 
ggaggcgcag 
cgacggggtt 
cggtgaaggc 
tgccgatcag 
ccagcagcgc 
ccgcgaccac 
gccccatccg 
ccttcgaggg 
tgcaacggac 
ttccgatgac 
gctctgggtc 
cgtcgtccac 
acatgacagg 
gctcaccgcc 
gcaggagcag 
ggacgcggaa 
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cgaggaacgt 
ggggcgtgcc 
tctggatggc 
gggcggctca 
gcacccgcac 
gcagactgtc 
cgaccgccgc 
ggtgcggggc 
agtagctcca 
gtacggcagg 
cctgagacgg 
cgtgccgacg 
gtgggccgat 
gttgttgcgg 
ctgtgccttg 
gacgagatcg 
cagttgcgtg 
cctggccgcc 
gcgaggacag 
gcgcacgccc 
accctcccgc 
gtcgtcgcca 
tcacacctgt 
gacgcgcgcg 
ccccggaacc 
cgtcacggcg 



cagccagtgg 

cagctcgccc 

cat gacgact 

gccggccgcc 

gggccggtac 

catggcgtcc 

ctgcccttcg 

ggtcttcacg 

ccccagcagt 

atcccgccgt 

accgtggcgc 

cacaggtcgg 

gcggtgtcgc 

tggatgcgct 

gcggcccagt 

tggcccgcgg 

ccccgatggc 

gggtcgttcc 

cgtcagcttc 

ccgccgcccg 

tccgcctgcg 

ccgacccccg 

ctcggcacgc 

ccacccgccc 

ggtgacgcat 

tccggaaacc 



atggtgacct 

gcgcgcggga 

ccgtacgaac 

ttctcccagc 

agatggacgc 

tggcgcacgt 

gcggagagca 

acggacagcg 

cccccgctgt 

gccgcaggta 

ggggtgcgtc 

cgaggccgtc 

cctcctcgaa 

cgaaggactc 

cgcggctgga 

cgcggtaggt 

cgccctgcgc 

tgacccgagg 

gtccgcggca 

aacccgagct 

agatcagatc 

cggcaccgct 

ccccggcgaa 

ggacgcaccc 

ctcgtcggcc 

ccggaaggtg 



32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
-3354 0 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
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accggcctgc gtgtcttgaa gccgagccgt tcgtacaagg cgatcgcgcc ggtgttcgcc 344 4 0 

tcggccacgt gcaggaaggg acgatcaccg cgcgccgaga tgcgctcggt gagagcgcgg 34500 

acgaggcggg cggcataacc ctgcccgcgc gcctcgggag cggcgcagac ggcgctgatc 34 560 

tcggtccagc ccggaggacg caggcgttcc ccggccatcg ccaccagggt gccgtcgacc 34 620 

cggacaccca ggtaggtgcc gagttcatgg gtacggggcc agaacggccc cggctcggtc 34 680 

cgcgcggcga gatccagcat ctcaggcacg ctgtccgcgc ccagctcgac cacgtcggtg 3474 0 

tcggacgcgg agcgagttcg gccggggcgg ccgtcgccgg gccaggtcat ctgacggccc 34 800 

tcaagactga aaaccggctc ccaacccggc ggcggaacgg ccggggagct gaacatgtcg 34860 

gcgaaggcgc cgggaccgag taggccggce aggtcggccc agtcctccgc gtccgggtcg 34 920 

acggacacgg aggagaaggt cgccacgtcg gtgagatagg tggctgctcg accgaaccgt 34 980 

cgggcgagat gagcgtgccg accactgagc gactgaccta ccgggtcgtc gagtgcgggg 3504 0 
tcgtcgtcgt tcatcatcgt gccgtttcct tcctggtgag cgcggtggtb "gaagggtggc 35100 

cgcggtaggc gaaaagtcgg cggcggggcc cgtggcccga tagtcgtagc ccttgtcacc 35160 

gtgcagtttg ccgggtcgcc tgaggacttc cggctggagg ccaatgccaa agcgctctcg 35220 

tgccggcgga ggcacgcctt ctgacgtgcc tccaccggca ccactcagtt caggcagatt 35280 

gagcttgagc gatgcagcgc cgccggaagt cgagcgcctc aatgcatcga gcggcgactt 3534 0 

cctcgttctg ggtgagagca gtcctgcctt gtcgagtgat gcggcgttcg gaccgtcacc 35400 

ccggcgaagg ccaggacctg tcccacggag tggctcatcc acctccccct cctcggccca 354 60 

cagcttcagg cccgacgtag ggggaggggc gactcggaac ccggcgtccc gctcgcgaag 35520 

gtcggtcaga cctgttcgaa gtggaacgcc ttgatgaagc agtcccgggg ttgggcgacg 35580 

gcgaagagga tgcactccac gagggactgc gcggtgaggg cgtcctcggc ttcgcgtgaa 3564 0 

gtggtcgccc actcctcgga gagcgggtcg gcgttgtcga agtcgggcgg gtagagcgag 35700 

atcacccgga ctccttgggc gcgcaggcgc ttggagagga tttcggtgaa ccctgcctgg 35760 

gcgctcttgg ccgcgtagaa ggcgtcgtgt gcgtccgagc ggtggtggcc cggtgttccg 35820 

caggcggaga ccatcgtcac gacgtcgggt gtgtccgagt tgagcaggag ggggaggaaa 35880 

ctcctcgtgg tcaggaccgt gccggtggct ccggaggcga tggtgtccac gacgtcggcg 3594 0 
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tcggttgccg acagcaggtc cggcccggtg aggtagcggg agccgttgtt gacgagtacg 36000 
tcgacgcggt cggtgtgttc cgcgacgccg gaggcgaagt cgcggatcga ggcaggatcc 36060 
gtcaggtcgc aggcgaaggc gtgcacccgc tggtgtccgc ggtcgcggat ctcgtcgcgg 36120 
acccgttggg cggcggcgag ccggcgtgcc gagaggaaga cctccgcgcc gaggtccgcg 36180 
aggcggatgg ccagggttcg tccgaagtcc cggccggcgg ccgtgatgac gacgcggtgg 3624 0 
ttgtcccatc tcatggtgtc gttccccagt cgccgtttcg tggatcgggt ggtgccgtgc 36300 
accgcgtctc tacgctatcg gtcatggtcg ctcacgaacg gtcgttcacg gtcaatgatg 36360 
atgttgaggt gcccaacccc ggtgcggacg aggtctggac cgtcggcgcg gtcatcctca 36420 
atcgggaagg tcgtgccttt gcccagaagc ggagccggga ccgtcgcctg ttccccgggg 364 80 
cctgggacat cgtgggcggt catgtcgagg agggcgagac gcttctggag gccctcgcgc 36540 
gtgaagtcga ggaggagacc ggctggcgcc tgacccgtgt gcggcggttc ctcggcacca 36600 

cgacctggac-gggggacgac - ggcggcggcc tgcgtcacga ggccgactac ctggtcgagg 366 60 

tggacggcga cctggaccac ccgaggctgg aatggtccaa gcactccgcc tacgactggt 36720 
tcggccccgg cgatctcacc cgcctcaagg agaaccgcgg accaggggag tacctgatcc 36780 
acgacctcat agccggtgcc gttgccgact cgcctttcga cttgctccgg gcggacgccc 36840 
tcaccagccc ggaccggctg cgcgagctct acccgcagcc gaacccgaac tcgctgcgca 36900 
aggagaccga ccgcctgacc gaggagaccc gggcgctgat cggctgttcg tcactggtgt 36960 
tcatcggcag cgcggaccgc gagggccggg cggacgtgac gccacgtggc ggcccggccg 37020 
ggttcgtctc ggtgctggac gagcagaccc tggtgatccc cgacgcgacc ggcaacaaac 3708 0 
ggctcgacac cctgcacaac gtgctggaga ccggacgcct ggggctgctc ttcctcgtcc 3714 0 
ccggccgccc gaccacgctg cggatcaacg gacgcgcctg tgtttcggcc cgcccggagc 37200 
tgctcgcccg cctcactccc gtcggaaagc cgccggtcac cgcgctggtg gtgcaggtcg 372 60 
agcaggtgta tccgcactgc ccgaagtcac tgatgcgcgc cgacgcctgg cgacccgagc 37320 
agtggatgcc cgccgacgcc cagccgagca gcgccgaggt gacccttgcg cagctgaacc 37380 
tgcccggcct gaccctggac cggatcgagg atgccgaacg ggagtcgctg cgcctgcggt 374 4 0 
acgaatgacg acgagtcgat gagcgccgat gagccgatga gacccgacgg gatccgacgg 37500 
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gtcggcgtcc gcggcgagca gaccggtcgc gaaggtcacc gcccgcacgg cggcgaccct 37560 
cgcgacggtc agtactgtcc ggtcaggtgc gggtccagcg ttggttgctg ccgttggagc 37620 
aggtgtacag ctggatcagg gtgccgttgg ccgtgccgtt cccgacggcg tcgaggcaga 37680 
ggccggactg gacgccgacg acggacccgt cggagttgag gcgccacttc tggttgtcgc 37740 
cgccccagca gctgtagatc tggaccttgg agccgttgcc ggtgcctgcg gcgtccaggc 37800 
acttgtcgcc gtagaccctg agctcgcccg cgtcagtggc ggcccactgc tggttggtgc 37860 
cgctgtggca gtcccacagc tggagctggg tgccgtcgga ggtgctggcg tcgggcacgt 37 920 
cgaggcagcg gcccgaaccg acgcccttga tctgtccccc gtccgcgggg ggctccgagg 37 980 
agtcgccgcc gttgagtgcg tcgaggacgg cggtgtacgc ggccttcttg ctgccgtcgt 38040 
tgttgaacag caacggcgtc tgctccgacc gccaggagtc gctgtcgcgc acaccccaga 38100 
cggtgacgcc gaggcagcgc gagacggcca ggcagtcgtt ggtcacgttg gcgtaggtcg 38160 
aggccggggc gccctggatg tccagctcgg tgatggccac gtcgacgccg agggcggcga 38220 
agttctgcag tgtggtgcgg aagttgctgt tgtaggggct gccgctgttg aagtgcgact 38280 
ggaagccgac gcagtcgatc ggcacgccgc gctgcttgaa gtcccgcacc atgttgtaca 38340 
tggcctgggt cttggcccag gtccagttct cgacgttgta gtcgttgtag cagagcttgg 38400 
cggacgggtc ggcggcgcgc gcggtgcgga aggcgacctc gatccagtcg ttgccgctgc 38460 
gttgcaggtt ggagtcccgc cgcgctcccg aactgccgtc ggcgaaggcc tcgttcacga 38520 
cgtcccactg gacgatcttg cccttgtagt gggccatcac gccgttgatg tggtcgatca 38580 
tcgcctggcg cagcgcgctg ccgctgaggc tctgcatcca gccgggctgc tgggagtgcc 3864 0 
aggccagggt gtggccgcgc acctgcttgc cgttctgcac cgcccagttg tagacgcggt 38700 
cggcggagct gaagttgaac tggccccgct gcgg 38734 



<210> 31 
<211> 3331 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
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<400> 31 

tcggatctcc ccacacaaca tagatagagg atatccgcct gggttcacaa tgaagttact 60 
ggtggttctc accaccctcg tgggctttag ctcagcacta agtttcggtt gtaattacag 120 
accagtatta ggcttcaatt cacagtatat gctgggagga ctaagacttt tctgtatgcc 180 
tgccatggtt tatgatccat gggcatgtgg ttgcgtttcg gcatggagca gtgcaggtct 240 
ttacggtgtc ggagggggcg gaggcgcctg gggagctggc ggtgctggag gagccgacgg 300 
cggacgcggc ggcggcggtg gagattggga atatgactat gatgacgaca gcgatgacga 360 
tgatgaatgg gactgggatg atgacggtgg aatgggagct ggcgccggag gtggtgctgg 420 
tggtggtgcc ggaggtggtg ctggtgctgg tgctggagca ggcgcaggag caggagcagg 4 80 
tgctggactc ggacttggat tgggcggagg tctcggaggt ggacttggcg gacttggagg 54 0 
tcttggcgga cttggcggtg gagacgattt atttgattta gatttcgatg atcttggtgc 600 
agctcttgcc ctcggtggag ctggtggagc tggaggtgct gctgctgctg ctgcagctgc 660 

- -cgctgctgcc-geegggggtg-gagttggtgg agctgctgcc -gcagccgeag ccgctgctgc 720 — 

cgctgcagga ggaggcgcag gtagacttgg aggagctgct gctgcagccg cagccgctgc 780 
tgccgctgca ggaggcgcag gtggacttgg aggactcggt ggcggacttg gaggactcgg 840 
tggcggactt ggaggcctcg gaggtcttgg tggcctcgga ggatatggag gatctgctgc 900 
tgccgctgct gctgctgccg ccgctgctgc cggaggtgga ggactcggtg gtgttggttt 960 
ctacggtgga cgaggaggta gacgcggtcg aggaagagga ggccgcagac gtgctgctgc 1020 
tgccgctgct gcagctgccg ccgcagccgc tggtggtggc ggaggaggtg gaggtggtgg 1080 
aggaggaggc ggaggcgctg gtgctgccgc tgccgctgca gccgctgctg catctgcttc 1140 
agcttctaga caaatgagtg gtataaggga cgcattagga gacattaaag accttctcag 1200 
gagtaatgga gcctctgcaa aagcctctgc taaagcatca gcagtagcaa gcacaaaatc 1260 
tcaaattgac gatttgaagg atgtcttaaa ggatcttgca ggtctattga aaagctcagc 1320 
atctgcttca gcatctgcat ctgcatcagc ttcagctgga ggtggaggcg gtggtggtaa 1380 
cggaggtggt aacggaggag gaggcggcgg tggagctgga gctctagctg ctgctctcgc 1440 
tgctgcagga gccggaggtg gacttggagg tggaggcgga ggcggagctt tagccgctgc 1500 
actagctgct gctggtgcag gtggaggagg ttttggtgga cttggaggac taggcggtct 1560 
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tg g t ggggga tctgccgcag ctgctgcagc cgctgccgct gctgcatcag gtggtggagg 1620 
aagagcactt agaagggctt tgagaagaca aatgcgtgga ggtggatccg ctgctgccgc 1680 
tgctgctgct gctgcagctg ctgctggagg tggatgggga ggtggaatgg gtggaggatt 1740 
cggagtaggt ctcggtggag gattcggagg aggatttggt ggtggatcat cagcagcagc 1800 
tgctgccgct gctgcagccg ccgctggatt tggtggaggt ggacgaagag gtagaggtag 1860 
aggacgtgga ggcgatggcg acggtaacgg agctagtgct gtagctgcag ccgccgccgc 1920 
tgctgctgct gctggaggat ctgctgctga tgttgccgct gccgctgctg cagccgcagc 1980 
tatgtacggt gacggtgctg atggacctga tttcgataat ggattcggtg gtggaaacgg 204 0 
aaatggaggt ggcggatctg gtggtggcgg atccggcgga ggtggatccg gtggcggatc 2100 
tggaggtggc ggtggatctg gtggatcagg cggtggcggc ggatctggtg gttcaggcgg 2160 
tggcggatca ggcggcggtg gaaacaatgg atggggaaat aacggcaaca ataaatatga 2220 
cgatgatgac tgtgatgaat atggtaaccc tattagaagg gggtaaatta tttgacatta 2280 
tccgccattt gactcatttt tcttagttct ctatgtttta tacttcacct tagattgttt 234 0 
tagtttgatt gaataaatta tgttttcgat ataaattttt tttaaattaa attaaacttt 2400 
attagttgac ctgtaaactt tttcatggag ttataatcta aggaacaaaa aacatacata 24 60 
atatgttcag tattgtggta aagcacctgt accgcaaaca caatcacctc tatacatgta 2520 
tacaaaatca gtaatgctga caaaatcttc tacactctca cctacacact cgcacacagt 2580 
cctcttacat acacagcact ataatatcct gaacatgaag tttgtgttga taaaaagttc 2640 
agaaaaatct cccctacatc acctgatctt tcactgaaaa tttacgacaa gtattgaaaa 2700 
tagcagaaag aaaacgggaa attgagaagt tttctataaa aaacaatcgg aacaatgact 27 60 
ggaatgacaa ggatgaaaat aatgataact tacattaatt aaggccccaa taatctctct 2820 
attttcaaac ttttttttca aatgttctct ctaactcact tgcatctatg tggaaattca 2880 
catactatac taaattacca caagtatcaa ggtttcacaa cctctcatgc cttcatggca 294 0 
gaccatgctg ggtatttgtc taacaatgcc tcataaatac ataaaactaa ctaacaaaat 3000 
aggtcagtct gtaacaaatt attaatgcac cattattgca ttttctaaaa caaagcatac 3060 
actggatatt ggcagacaaa atgttgttat tggatacctt tccattctat ctagacactt 3120 
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gctttccaca agtcatcata aataaatccc ccctatccca aatgtcaatg gaatgcccca 3180 
acccttcccc cataatttta aaacctagaa taaattaaaa catctatagt tcgtcatgat 3240 
catctttett atcatcctct tcttcttcct cctcctcctt cttcttcttc ctcctcctca 3300 

3331 

ggttcttggc tgcctgctcc ttccttgcca a 

<210> 32 

<211> 5224 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 32 

ggatcccctg ctcgacgccg gcggcccggt acacctccat cgtgcggacg ttgttcccgc 60 
gcccgcgagg gtggacggag gtgccggcgt gacgctccac cagcatgtgc cgcaccccga 120 
gccggcccag gaacacggac gtcgacaggd-ccacgagcga-tccgccgacg acgaggaccg 180 
gaaccctgtg gaccgtgtcc ccggcccgat cggctcttgc gttcatcttt ctcctccagc 24 0 
gcgtgatgtc cgcccactcg gccggtttcg gccggggtca tgcatgcccc gcgaggctgg 300 
agcgcggtgc gccggggacc acacttcacc cgcttaaccc gctgcgttcg cgcaggggca 360 
cggcacgccc gacgatcgtg ctcacgggcc gacgcaccgt catgtgacgc gtcggccgcc 4 20 
ttaccgttcc tccaggaaga ggtgcgcctc aatgacggtc tctgccgctg tgtccacggt 4 80 
cccggaccgt gtccccctca ccgtgttcga cggttcccgg gtgcgggtcg tgctgatgct 540 
ggacatccgc gacgggacgc aagcggaggt cctggacgcc tacgagcgga tgtccgaccg 600 
ggtcgccgcc gtgccggggc acatcagcga ccagctgtgc cagtcgctgg agaaccccac 660 
ccagtggctc atcaccagcg agtgggagag cgcaccggag ttcctcgcct gggccaacag 720 
cgaggaacac ctggagatgg tccgtcccct ggagccctac gtccgcggca cccactcgat 780 
gcgctactcg gtgctgcgcg agacggccga ggagcgggcc ggggcgggtg cggcggcccg 84 0 
gggcgcgctg cagccccggc cgcgcatcgg cgacaacgtg gtccggcacg ccgtcaccta 900 
caccgtcaag cccgacagcg tcaccgaggt cgtgaagatc ctctccgcct acacctcgcc 960 
cgaggtgcgo gtggacgaca ccacgcggct cgtgcgcacc tccctcttcc tgtacggcaa 1020 
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ccgggtcgtc cgggcgatcg aggtgcgggg cgacctgcag gccgccctgc gccacgtggc 1080 
ccggcagccg gaggtgcgcg ccgtcgagga agccctcacc ccgcacatcg aacaggaccg 1140 
ggacctcacc gacocgcggt ccgcccggct gttcttcacc cgggccgcgc tgccggccgt 1200 
ccaccacgtg gtgtccgggc gcgggacggg cggcgacacg cagcggtgcg cgctgtacta 1260 
cccggcccac cccggcgccg gaccggcgct cgcocggctg ctggcgcggc agggcgaggc 1320 
caccgtgggc gacccgggca gtccggtcgt cgcctgcacc gtcttccacc gcgacgacct 1380 
cgtcgtacgg ctcgtcgaca cggcgggcgc accggagcgc gcgcccgggg ccgtcctggc 1440 
cctgcacgag ccggacgccc tcgccgaggc cgggcggctg ctggacgccg ccgcgctcgg 1500 
cgccgacggc cccccggacg accgggcgct gccgacgttc ctcgcgcacg cccggatgcg 1560 
gcctctgaca gaccgtcagt cgccggcctc ctgacccccc gctcgcccga cctcagggag 1620 
tg_acc y acat_gacagaacag "ggcacgca- tcgtcgcctt cgacgacgtc ccgcccaacc 1680 
ggcggcgcgg cggcgacgtc cgggccctgc tcacgcccac gaccgcgggg gcgaccagcg 1740 
gcttcatggg cgtggccgtc gtacggcccg gagaacgcat ctccgagcac taccacccgt 1800 
actccgagga gttcgtgtac gtcaccgccg gcgccttcga ggtggacctg gacgacgtgc 1860 
cgcatcccct gcgcaccggg cagggcctgc tcatccccaa ggacgtgcgc caccgcttcc 1920 
gcaacaccgg cgacgtcgag gcgcgcctcg tcttccacct gggtccgctg gccccccggc 1980 
cggacctcgg gcacgtcgac accgaggaga ccgacgagac cgcgccggcc ggggtggtgt 2040 
catgagccgc cgggtcgtcg tcaccggcat aggcgtcgtc gccccgggcg gcatcggcgc 2100 
ggcccggttc tgggacctgc tggccggcgg gcgtacggcg acgcgccgga tctccctgtt 2160 
cgacccggcg cgcctgcgct cgcagatcgc cgccgagtgc gacttcgacc cgtccgcgca 2220 
cggcctggac gacgagacgg tccggcggtg cgacpggtac gtgcagttcg cgctggtcgc 2280 
caccgccgag gcggtccgcg acgcgggcct ggacaccacg cgcgaggacc cctggcgcat 234 0 
gggggccgtc ctcggcacgg cggtcggcgg caccacccgc ctggagcacg actacgtcct 24 00 
ggtcagcgag ggcggctcgc gctgggacgt ggaccaccgg cgggcogagc cgcacctgca 24 60 
ccgcgccttc gcccccagca cgctcgcctc caccgtcgcc gagaccttcg ' gcgcgcaggg 2520 
cccggtgcag accgtctcca ccggctgcac gtccgggctg gacgcggtgg ggtacgccta 2580 
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ccacgccatc gccgagggcc gtgccgacgt gtgcctggcg ggcgcctcgg actcgccgat 2640 
atcgccgatc accatggcgt gcttcgacgc catcaaggcg acctcgccca gcaacgacga 2700 
cccggagcac gcctcccgcc ccttcgacgc ccgccgcaac gggttcgtga tgggcgaggg 27 60 
cggcgcggtg ctcgtgctgg aggagctgga gcacgcccgg gcccgcggcg cggacgtcta 2820 
ctgcgagctc gccggctacg ccaccttcgg caacgcccac cacatgaccg ggctcacccg 2880 
ggagggcctg gagatggcgc gggccatcga caccgcgctg gacatggccc gcctggacgg 2940 
cacggacatc gactacgtca acgcgcacgg ctccggcacc cagcagaacg accggcacga 3000 
gaccgcggcg gtcaagcggt cgctgggcga gcacgcgtac cggaccccga tgagctcgat 3060 
caagtcgatg gtgggccact cgctcggcgc gatcggctcg atcgaggtcg tcgcctgcgt 3120 
cctcgccctg gcgcaccagg tggtgccgcc cacggccaac tacgagacac cggaccccga 3180 
gtgcgacctg gactacgtgc cgcgcgaggc acgcgagcgg gagctgcgca gcgtgctgtc 3240 
ggtgggcagc ggcttcggcg gcttccagtc cgcggtcgtg ctgaccggac cggagaggag 3300 
gctgagatga gcgcaccccg gcgagccgtc gtcaccggac tcggagtggt ggcaccccac 3360 
ggcatcggtg ccgagacgtt ctggaagacg gccgtggacg gcaccagcag cctggcccgg 34 20 
atcgaccggg agggctgcgg ccacctgccc ctgaagatcg ccggccaggt ccccgacttc 34 80 
gacccggccg ccctgatcga ggacacctac ctcgtccaga ccgaccgctt cacccacttc 3540 
gcgatggcgg ccacccagct cgccctcgac gacgcccggc tctcccgcgc cgacatcgac 3600 
tcgccgtact cggtgggcgt ggtgacggcc gcgggctccg gcggcggcga gttcggccag 3660 
cgcgagctgc agaaactgtg gggccagggc tcgaagtacg tcggccccta ccagtcgatc 3720 
gcctggttct acgcggcgag caccggccag atctccatcc gcggcggctt caagggcccc 37 80 
tgcggcgtgg tggccgccga cgaggccggc ggcctggacg ccctcgcgca cgccgcgctg 3840 
gcggtacggc gcggcaccgc caccgtcgtc gccggcgcga ccgaggcccc gctggccccg 3900 
tactcgatgg tctgccagct gggttacccg gagctcagcc gcagcgccga cccgggccgg 3960 
gcctaccgtc ccttcacctc cgccgcctgc gggttcgtgc ccgccgaggg cggggcgatg 4020 
ttcgtcctgg aggaggaggg cgcggcacgc gagcgcggcg ccgacgcgcg ggcgacggtg 4080 
gccggccacg cggccacgtt caccggcgcc tcccgctggg aggagtccag ggccggcctg 4140 
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gcgcacgcga tcggcacggc gctggcgcgg gccggctgcc gtccgcagga cgtggacgtc 4200 
gtgttcgccg acgccctcgg cgtgccggag gccgaccggg ccgaggccct ggccctggcc 4 2 60 
gacgcgctcg gcccgcacgc gcggcgggtc cccgtcaccg ccccgaaggc gggcatcggc 4320 
cgggcgttct gcgcggccgc ggtgctcgac gtggcgaccg cgctgctcgc catggagcac 4380 
gagctgatcc cgcccacccc ccatgtgctc gacgtctgcc acgacctgga cctggtggtc 4 4 40 
ggccgggcgc gtcccgcccg gocgcgcacc gcgctggtgc tcagccgcgg actcatgggc 4500 
aacaactcgg cgctcgtcct gcgcaggggc gccgcgccgt tccccgagta agtaccccga 4560 
acaggtgtct cacgtcccct tcgggcgcgg gcacccgagt caaggagctc aaccacatga 4 620 
ccgacatgac cgaacgcgtg ggcacccagg tgaccttcga ggaactgtcc gccctgatga 4 680 
agcgcaccgc gggcgtgcac gtggaaccgc ctgacctgcg ggcgcgggcc gaggagggct 4740 
tcgacggc-tt cggcctggac tccctgggcc tgctgggcat cgtggccgag ctggagaaga 4.800 
agcacggcgt gggactgccg gagcaggtgg agcgctgcaa gacgcccgcg gagttcctcg 4860 
cgcaggtgaa cgccaccctc aggacggcgg tgtgacatgg ccgggcacac cgagaacgag 4 920 
atcgtcatcg ccgcgccgct ggacctggtc tgggacatga ccaacgacgt cgagaactgg 4980 
ccgcggctgt tcagcgagta cgcctccgcc gagatcctgg agcgcgaggg cgaccgcgtc 5040 
cgcttccggc tcaccatgca cccggacgac gagggccggg tgtggagctg ggtctccgaa 5100 
cgcgtcgccg accgcgcctc cctgacggtc cgcgcccacc gcgtggagac cggccccttc 5160 
cagttcatgg acatccagtg ggtgtacgag cagacgcccg agggcgtgct gatgcgctgg 5220 

5224 

atcc 



<210> 33 
<211> 30601 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 

gatcttagac cttattcact tgatacgtgt aatagttatt acgatagtat gtttttggcc 60 
gattcctccg cgtcttcttt cgacgacgtg gaggtggagg caaaagcgaa gtagttgtgg 120 
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aagaataaga attatgatta tcatgattat tattcaaatt aactctattg ttacgtaccg 180 
cgctccatgc agacgtttgc caggagacga cgggtggaag gataggaagc gaagaagcgg 24 0 
aagcggaaga cgtcgtattt gaattcgaag atgataatga tgtcattgat gctgatgatg 300 
ttttgttgtg ataatgagat cggcatggag gcatttcaca atctctttgt tcgcgaggct 360 
taatctctga gcactcgata tcgtcttgtt tacgaccgga aagcacgtct tcacaccaga 420 
ttttgcggcg ttgaactccc ccgccacacg atacagaaca ctggaattat tattagaagc 480 
ttcaatgatg ttctaagaac ttacgtgagt ccatggagat atctgccaag aattatttgt 54 0 
gagtggtgga catagttctt cattgcattt atcaaacatc tttggttttc tggtttcatc 600 
gcaagaagac gaagtgcaag atacactgcg acgacgccat cccccaccac aagaagcaga 660 
gcactgaaca atcgtacatt agtaaattct aaatctgaaa attatatatc ccacttttga 720 
ccaatctcca ataatccagg atccaatatg ttcttctccc tttggacagg gttcaagtcg 780 
gcaatttctt gcacttgttg"gtctcttttg cacatcacaa tcaacatctt tcaaaatcgt 84 0 
ccgaccaccg tcttccgcac tgacgcatgt aacatttcta ctttgttgaa catgagttcc 900 
acaagtagct ggacactctt cccattccgc cattttccag tatgaacaat cacgaaggcg 960 
acaatttctt gttgatactt ccttatccaa atgattgcaa tactcatcag gaagatcacg 1020 
aacatgatct cggcacttga gaagacgacg ttgagtacca ttaccacaag ttgctgaaca 1080 
ggctgtccat ggtccggttg cccatcggat tggtggtacg tcggcttgaa gtttttgaag 1140 
tactctgggc ccatcacaag tatctttttc acaagtcttt tttaggcgtg gacgagtatt 1200 
ctgaaatgat attttcgttc aagattaaaa gcaaactgaa acgtactcga tcacaaaaat 12 60 
attcatcaac aatagttcct tcagatccac gagtacacga aacacttcta gtctgaattc 1320 
ctgatccaca agtgactgag caaggggacc aatgacttgg tttccaagat gtacatggca 1380 
gaagatggca agtttgacta gtttctggca ttttggtatc tccacaaaaa gaagcatcaa 14 40 
cagattgttc gcggtatatg cattcggttg tacgttcccg atgaccgatt ccacaagata 1500 
cagaacactg aaacgtattt atggtattga caacagcaat tctggagtat ttgaataaac 1560 
ttacagcact ccaatcagta tttctccaaa atgggcaagt gcctaaatta catgtctgat 1620 
gagaagcagg ccgatcagat gcagtaccac aaagtgacat atcgacttca gttccatttc 1680 
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cagaaacaca tgaaactctt cttgacgacc atccatcctc acaagaaaca ctacactgag 1740 
accattctcc aagtttatat tttggacatg attctctatg acatggtttt gtaattatct 1800 
tttccatttt aaggcaacga tgttccggta gtactgatct atgacgatcg gtacaattag 1860 
cgtctcgata ctgtacacca tctccacact tagctgagca gtcagaccag actccgaact 1920 
gccaccaagt acaagcatgt tcattacaat gttcttgtgt ctgtgctgga ccacatctgg 1980 
atgtatgtgt ttcccgatcg gctgcatcca aacattgagc atgacgcatt ttgactccac 2040 
catcacaact tcgagagcac tctgaccaat gcccataaac ccatcttgga catggaattc 2100 
tgttacattc ccgttctgtc gcctctttct gttctctgcc gcacaatgac tcatcaactc 2160 
gacgattcga atcatcaacg caatatgact tccgatgcat ttttccattc gatccgcaag 2220 
tttcagaaca tgaagtccat tctccatagt tccattttct tccagagcag tcaatgtaac 2280 
aactggcaat atcggatggt tttgaattac gatcacatag atgttcggat gctggagttt 2340 
gacgatcacc ctccattttt acgcaagaaa ctcgttgacg tttctgtcca gatccacatt 2400 
tggcactaca actagacaca tcttcagtga tccatctgta aatataaaaa tttattatag 2460 
aaatctaatg aaaatatgta gtttaccttg tagaacaatc tatattgcac attcgtgttg 2520 
cttgttttgg tttgagaaca ttttgacaat ttctatcatg actttgacga tgagtcgaca 2580 
tgtccagaca cattaatttt tgcgattgct gtccacgaca ggctctatca cattctgtcc 264 0 
aagtatccgt aactctccac aaatacaatg cactggatat tggccgaatt acagcatttg 2700 
gaacagccgc agtcatgtac tcatatgaga tgtcgggtgg atgactacca acagaaagaa 2760 
catgaacata aatgtcactt ctaatcggac cagttccatt tatccgttca ataattgcat 2820 
cagaaccaga atattcgaga acagtgtctt ggaatgcaat ttgttggcga gccagtgata 2880 
cttggaaatg accgttaagt aggaattcac cattggcggc acggagagct gaaggttaaa 2940 
ataaagattt tcatgggtat tgataacaca agggtgagtg atgaaaaaag taaatgttcc 3000 
aaaaacactt tgtatagaaa ctcacaaaga taattgtcat cttctttcat attattatat 3060 
cctttctgcc ggatatcaat atttgcagaa ccagctggaa tcttcattac ttcgttataa 3120 
ccaaaggttc cttgctcatt aaatgttcct ttgacaacct tacaggaaga atcatcccca 3180 
ccgcaaacac cacatttgtc tcttcggaga gttgaatgaa gttgatgatc acagcctgaa 3240 
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aatccactta ttttcaattt tcttttgaaa tcagatcaat gttacctgct ggcatacaag 3300 
ctccagctac acaaatatcg tctccatttc tatcacatgg tgttccatca acaactttat 3360 
ctcgaagcag atagaacgct gcagatccac tgagccgaca atacagcttg caacgttcat 34 20 
ttggtgcaac attcgcatat tttggaaccc agtgagtatt cgttgaagcg acaccttgga 34 80 
ttccaatatc tttattgttg aattcagaac attgaacttc acggtatggt tgagtatccc 3540 
atgggcattc ttgtgtatta catgaccgat aacgttctcg ttgaccaaca cagtactttc 3600 
caccatttcg aggtctaaag taatatggga aaatgtcatt ttaatattga taggaaagct 3660 
tagccagtgt ggcctaaaag ctggaagttt ttttaaagat gcgttttcta tcaatttaag 3720 
ataaccggct acttcaggtg attctataaa ttttataaag cttggaagct aggtaaatct 3780 
gaaaagcctt aaactatctc gaagcggccc gaaagcccag aaaagcagag acggacaaac 38 4 0 
atttaagagt gatcagaagc actccatacc ttgatgttac atttgatttt agtgtttcca 3900 
cctcgttttc acttctgaac 'tcgccgattg aaaatatttt gattgaatat attatttgct 3960 
ttcagactat ttgatatcat ttcgtttggc agtttaactc actttgggct gtcacaatct 4 020 
cttaatcctt tttgaacacc accaccacaa gtacgactgc attctcccca tgatcgccag 4080 
tcaccccatt gtccgtcaat tttggtaagg gattcggggg ctagacgaac acaggctcca 414 0 
tgatgacaga actaaatatc caagttttta tgagtttctt ttgtgattaa tttctgagat 4200 
actcaccatg cttcttgatt cgtcacaagg agttccgtcg gcccatggca tatgctgagt 4 260 
tcgacagccc atctggcttc cgtagaatgt tgcacaccaa agacggcggc atgtcggctg 4 320 
taaaatatca atgt-ttcatc ttaaagaata tatttaggca aactaaccat ataagggcac 4 380 
aactcagaag ctggtccaaa tacaaacttg cactgttgat gagcatcgta tttctttcct 4 440 
ggttcatcac gtacaaagac atcctcgtag taacgacgtt cgaccggctg atcgaataga 4 500 
cattgagttt gacctcgatt atttctgaca aattgacaat taaatagaat caaaatttta 4 560 
atagctatct tactcgagga atcgttcgag cattccagct gaacatggcg accaactcca 4 620 
tggatgagtg ttatattcca acgttggtgc cattatgtgg aagttgttct gaaactgcgt 4 680 
tttatcaaat ttagtgcttt ggaacttgca aaccttatta accggcatgt aggtagagca 4740 
ttttcgttcg tcatcatgag gaatcgaaaa cacatgaccc aattcatgag caattgtgaa 4 800 
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tgcagcactc aatccattgt cttctatgat 
tccaagttca gcaagtccaa gtgtatcgca 
cgtcaaaagg attgcaacgt catgatgttg 
ctgctgccat ctacagaaat cttgaagtgt 
agcgttttcc gttttcaaaa cgatcaactt 
ggattggtga cgatagatgg aggcaactgt 
agatcttccg tgatattcgt acatttttgt 
atcccaagag ttggcagctc ttcgggatct 
tttcataaaa ttatttaaac atttttttac 
agctccactt tgacctatct ttgttgtcat 
aatggggatg, caagactaat aaaaggtatt 
tatctgtttc attgattcaa ttttcagatg 
cataacatag aacagcctgg cagcccgtga 
gcattcgacc acacacaagt gatccattat 
tagctatagg ggacccacga gcaaaattct 
tgggctatta cacaacaccc gatgggatca 
aaaaatattg tgataacttc tcttttttct 
tcagattgaa atctcgaaaa atgcatccgc 
tttctttgtg tttgtcctct ttttcccgga 
atgtacacaa aacgtacgtt ctgcgcaatc 
ttctactcca taaatcagtt ctctgtctgt 
tcacagtccc cccaccaagt tcttgtgtct 
agctctcaag cagacactcg agcgccagtt 
tcgaatttga accgtttgtt cttgttctga 
aaagagatct ggataatcta aagaatctag 
cttcccactg atcttgccac gtggcaaccg 
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tgcacaactt ttttgcatat cacacattgt 4860 
ttttccttgt gatcgacaaa tatctttacg 4 920 
gacactcgaa tcatctggat cattgtaata 4 980 
ttgttgagcg ttctgagtga ttcgtggtcc 5040 
gacaacaacg acattgatag atgcacgaag 5100 
ggagaagaga gtgagaacgt agtcttcaag 5160 
atccgccacc acaaggactt caacatagtg 5220 
tgctttgcgt tctataatta aatccttttg 5280 
tgtatccttc ctattaatct tgcaccccag 534 0 
tgactctatc aaaaactgtt caactatgaa 5400 
ggtaactggt tccagtagag ctttttttac 54 60 
tttatataac catcttaacc gttcaaatct 5520 
aaaggtgctg aaatcccagt aatttcaatg 5580 
cctttgctct -tttacttcgt taactaccat 5640 
atagtttctg tgtgtgttag ggtgttttaa 5700 
gcagaatctg agatcttttg ggaaccggaa 5760 
acatttttta cagaactagc aggtaaactt 5820 
ctactcaaaa agtcgttttt aaaatgattg 5880 
cgtacgcaac acaaaaccgc ttgcgcgagg 594 0 
ttttccctgc agctctctct ctctcacttt 6000 
ctcccaccac ctaaatcatc atcagcatca 6060 
tctctgacct ttacacgtcg actagggaaa 6120 
gaaaaaaata gtgtgtccaa atgagcagtt 6180 
cataaaccca aaaaaacgaa ctaggcggca 6240 
acaaatttca gaagttctta ccaataacat 6300 
tcgtctccgt ctcgttgaca ctggtcgagt 6360 
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taagatggtc aaacgatttg aagtgcattg gatcgaactt tcggacgaga tgttgcctat 6420 
ggcgacttgc tccgtcgtgc tctagatgtt taaagtgtca gagaaagtga ttacaaagtt 6480 
tctacctgtt ccgtttccac taataattgg ctcaaccgta tggattccgc tgggtagtgc 6540 
aagcattccg tactgaaaaa ggcttttatt caccaaaatt cgaacttata caaaccaatc 6600 
cgtcttccga gtcgcataaa ttgacgatgc tgtgctgatg tacaccttta acgtgtgcac 6660 
ggtagataca atcgggatct gttcgagaca ttccacctct aacctcctcc tccgagtcca 6720 
aatataagac catcggcgcg aaatttgaat tggaaaagtg gggaacactt ctggaattga 6780 
aaattaatac gactgttata ataaaattga aatctcatac ttgttatgtg agtccggtat 6840 
ttgattccat ctgtgcaaat gaacgatgta gacggcatca tctgatcgta atcgtaagtg 6900 
acaagcatgt ccacagtctc tggcaactcc ttggagtcga cgtcgccgat ctgttgacgt 6960 
gacatcacgt tttccacgac gtccataaga atctcttcgg acgatgtgat ggctgtcgat 7020 
gacgtgtata ccggcgtctt gacgccatcg actgtgatgc actggcacac ctgaaactta 7080 
gaacattatt tcacttcaaa actttttgga ttgttacctg agtacttggc cctggagaac 7140 
agcacatctg atgagaattc tgagatcgtg ccactccctg ctgaaaggaa catttggtta 7200 
aaaacaaaag ctgataaatt aaaataatta gataaaaacg aacattgcaa cgcataaacg 7260 
aggcagacga cgaggagtat gagagcggcg acgacgggct gcagcagatg gaatgagccg 7320 
ccgatggagc gcataccaac agctccgtga tgatgattat gattgtgtgg agagcagcaa 7380 
agaaaaaaga gatggaaaga agcagaagct ccgataaagt tcgtccgtct cttctgaaac 7440 
cttccaaaaa ctacctgctc gaggtgaagg gaagtcgtct gattgaactg ctactgcttc 7500 
tgatcttttg ataatctccc gagtttgtgt tttcgtttag tcgaattaaa attgtagatt 7560 
gtggaatgag cacttgcaat agggaacaga gcatcacaga ctgaaaaatt aaaaattatc 7 620 
tagaatgcaa gcaattttta aatttgtttt aaaatcactt attctgacgc catcttcttt 7 680 
tccgatttgc gcagaataaa taaaaacttg actgtaatat tgggaaaatt tcgaaaaaaa 7740 
acaccgttaa gtctgagccc acctttcgcc tttttttgtt gacgaaaaaa accaaacaag 7800 
ctttaaattc ataaaattcc caattttaaa aacatctaaa gtcaattcct cccaataatg 7860 
catttgtata tgaacaaaag tctgttgacc ataagtcgtt atattactac aagcaattgg 7920 
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tcatcaacaa acctcataaa aatcagtttt 'gaacgggagc aatttatata aactctgtgt 7 980 
gctcttttgc tctttttctt atttcttagt tgtcttctag ttccgccacc actttcgctg 8040 
ctcttgacga aatctgtaaa ttgttcgtca tttttgattt ataagatttg tttggctctc 8100 
ggtaggagct ctcaagctgc taatagtcct atagtaaagt actaaaaaca caaagaagca 8160 
gatgaaggtg tcataaaaca ctgataagaa tcatcatgat taggttggtg cagagaaaag 8220 
aagaagaaga aaaaggagat ttagagaaga gaaacaagaa taaaaatgca aaaataaaaa 8280 
aaatagtaat aacaatgaac gcagagtctt ccatgttgga gaaggaacag gacccatgtt 834 0 
gatgtgtatc tgaggggatc caatgtgtag tgatggtagt aaacacttga gagggaactt 8400 
ccacccccga ctagatgatt ggaagcaatt gatgatagat gtagagccaa agaattggga 8460 
cctactaatg atctagtcaa gattcttctg ataagagaaa aagacaagga agaacatgaa 8 520 
aatgactggt gattgaaaaa taaaacggtt tatgaagtcg gggtgtacta aagatgcaag 8580 
gtctcttgtg acgtattttt tcttccaggc acgttcgcgt tattcacgat tttatgcaaa 8640 
caaggtaagg agtgttttga attttgaata taaaaattta aaagaaatta aagttagaca 8100 
tttgaaaaat tagacaccct catgggaaaa attatagggc gaggagaggc ggtgagaggc 8760 
gccctaattt ctgctcggtc gggtagaatg tctaatctaa atcctacctc atgtttggct 8820 
ccttcttaaa tcaaaagctt aaggtcatct ctgaaacgtg cagttgacaa gttcaatggt 8880 
aagaacaggg agcaagcatt tacaacaaaa aagtaaacaa aaattgcatt tgtcgcagtt 8 94 0 
caaaatggaa caactcactc ccactcgaga acgttttgaa ggggagagga agaagaggaa 9000 
aatcatcaca caggcacatg gaacttctgg gacacaaaac aatacaaact gggtgccgtg 9060 
aatctcagta cacacacaca aaaatcaaaa aagacggaaa ttaggagcag atgtggtaaa 9120 
gggtggttca atgctgatgg gagagagagg gagaaacttc aaaaaaagaa gtttagattt 9180 
atgttggcta tttcaatcct aaatttatct aaacaattct aaaaatgctg gttttggaag 9240 
gttatctggt aatggtgaag ttttataaac aaaacaagac aaacaattct tgagatctta 9300 
aaaatcttag cgactacaac aatatttagg tattttttaa tggaaaaaag tattgattgt 9360 
tgacttggga aattgaacag caattttttg tacttttaaa tcagttatat tttaactttt 9420 
tagagcacat ttcgtagaca aaagggaaaa cgattggtcc aacatgtgaa gatgatgatg 9480 
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tcaacaagtt ttggatcgga gccaaaaaag 
aagaggtgca gcaacaactt ttatcaatat 
cccagccagt aattcttttc cgtagagttg 
gatcaagaga cgcagactat caaagtaaaa 
aacggaggaa aaaaggacga tgataagaga 
ggagagcata ttcttttgag aaaaatggga 
gtcgagcggt cggcgggaag acaaaaagtg 
atggcaccca ccgtggaaca tgatcatgat 
ttttttgcaa cggcgcgttt tggaaaacta 
tataccggga tgtctaagaa ttgtaaaatt 
caatttattt tggttatttc taagcgcctt 
attatctaaa ctgccgttct tcctatattt 
gactatacct cgattaatca taaaaatgtc 
aaatacaaga aatttatcaa ttgccaaaaa 
aatttgtcaa aaatttagtc aattggaatt 
gaatgatgca gctaattttg cagtttaagt 
aaaatatgaa gaagagtttt acgaaattaa 
tatgaaatct gatcccgact aagtttgatg 
tctatagaac actaaataaa tacattcact 
accgtactca cttttgactc aatgacccgc 
tctgaaacag caataatcta acaagagaaa 
accacataaa ttgacatgca tgatagatga 
tttccgagaa atcctcatag tttttggtaa 
actattactt ctaaagactt acctgattag 
aatttgagaa gttgaatcga gtttacgatg 
tcaagcatag aaaaaatgga aaaatacaag 
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aaacaaaaca ttcataccat gatgggaaac 9540 
tttgtttatg ttttgattat ttttctggca 9600 
acctagaaaa tgttggaggc ggagtcttag 9660 
tgagtaaaag gaagtgatat aaacttagga 9720 
ttgaagactt ggaagagtgt gctctttgcg 9780 
cctaggggca actgacgcaa ttgaaacatg 9840 
aagaaggatg ggcaagaaga agcaagagaa 9900 
gattgagagt gaaaattgga aatctcgaac 9960 
acaaagttga ccaaaaaatt attttacatg 10*020 
gagtgatcct ttctgtgaca taatttaaag 10080 
tttatactag catgttatat tgttaatttt 10140 
attattgcac cccctttgtt cattctgaca 10200 
acaaaagaat aaaaacaact aaaattaaga 10260 
ttcggccaat cggaaaaatg cttggttgcc 10320 
tgtcgatttt ccgaaatgat atgaaagttt 10380 
ttacattttc aagtttactg taatttttcc 10440 
aagataataa aaaagcaatg caaacatagc 10500 
gacataggat taataatatt agtctaactt 10560 
ctcgaaactc tcccttttct gccatcaact 10620 
aactgtcaag atgagttagt ttcaagattc 10680 
ctgaaaaaat agagtaaaac taataataat 10740 
ttttccggtt ttcaacaaga aaaacaacaa 10800 
gaaaaaataa attgatagtg atacggtatg 10860 
aaacgtgtag taattgaaga agaaaagttg 10920 
tctgaaaaaa acatagatat tatggtaaga 10980 
aaaatagaga ctagagattg cataggtttt 11040 
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gcggtggcga aaccgcacac atttttgtct gtgttatctc taattttacg ctctcggtgt 11100 

tctctattta ctgtccagaa gaatgaagaa tatgggggaa aagtgcgcgg gaaaattgag 11160 

agaccgagtg atgagagccg cagttttgca aaactttttc gggcaataat ccgccggcga 11220 

gtactacgag aagcacacac acatacgaaa actgttgagt taaaacctaa aaaattgttt 11280 

cgacatattt aattttcgaa ctaaagttta gagggtctgt gcgtgcattt ttgaattttc 11340 

caaacaactt tcagttttgc ggaagaaaat tacagcgatt ttttcgaata tttctgaaaa 11400 

caacactatt gcgtatcaaa aatttttcga tttgccaaaa ttcagactaa gttttggtgg 11460 

ttttggtttg caaacattta aaagaactca aaaaacattt ttagatgttc gaaaccgtac 11520 

aattgtagga tacaaatagc tacagaacaa ttagaatata aaatagagtt gtcaaacatg 11580 

tttaactaat acaaaaacac agaaactttg aaactcgaaa tttttatatc aaaattgaaa 11640 

aagcttgtaa aatttaaata tggatacagt acaaacaata taatcataga tcaaatagtt 11700 

catttattta tatatcttgg caaatcaaat cgtatccctt acccactcat attcgatgag 11760 

tctacaatta aatcagttgt tttttcatcc tcccggacta ttagtttaac ttccacttga 11820 

acaagggcaa agagtacatt aggaagagtt tatgatgaca ggaaaaaagc tatgtaaaat 11880 

gacctctttg gattgaaaaa gcgaacgaat tgaggtttag gacccccgga aaatgaagaa 11940 

ttcgtggcct cgagaatagc aaattggcgg aattaattat ccgtaagagt gtgaattgga 12000 

aacaaccggg acgaatggat tactgaatca aaaatgaaag aaagaagaga tgaaaatacg 12060 

tgtgaatcgg atgaaatgtg atgattttag aataacctaa atgcaacaaa acgacgtaaa 12120 

gacgcggaag aacaggaatg atcaaggggt acatcttata ggggaaaaat gcactttttg 12180 

tgctccaaat gtgagagata atcaggtagg aagagacgta gaataggaac aggaaacggt 12240 

aacgatagtg cgcaggtgct tgatttctgt gcttttgcat gtgttccgat ggaatttttg 12300 

gaacttttca aggggtttcg gaaagggttc gagatttcgc atgtgagctt tggaagaatt 12360 

ttggaagaac tttcaggata acatcgctca agcttgtttg ttagatttca gacttcaaag 12420 

tatataccga ttattgaaac attttaatcg tttcttacta ttagtaaagt ttaatcacag 124 80 

tttgaaaaaa aaatcacaat tttttcaatt atttagacca aactaattat ggtacagaaa 12540 

ataacttgca acccgggtat ttcattctaa tttttttcat ttggaaccac tagtttttga 12600 
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aatagaaact cgttaggatt cttcacatat tatcataact atcagtattt tgttgcacat 12660 
cagatctaag ttcagtctaa ttagaatcgc aaatttgacc atcacacttt aaaacaaatt 12720 
tacttaggca cagggcatcc ttctaacttt tttgtccccg acaaaatgat gacaaaaatg 12780 
acgtgaggaa tcaaggagaa aaaggaaaag aacaggaagc gaaaagtagg agaagctctt 1284 0 
gatttctgtg ctcattcctt gttcggatga gctcactgtt tgcaacattg gcgttggtgc 12900 
gcgggaatcg ccattgccga actttttcaa gagacagaga gagagagaaa gagaaggaaa 12960 
acgttccgat ttttaaaatg gaaaaaaatg aaagaggaag atgatgaaaa aatgaactct 13020 
gcgtgacatt tgttaatatg gaaaaagcat gattacttca aaattgtaca ctaatcccca 13080 
cagcacacat tttgaagact tttttacaaa aacaatggtt taagcaagct ttaaaaaatt 13140 
gatagtatcc ttaatgctta atcatatcca agtttagttt taagttttga tttcaaaaat 13200 
ttctacatca aaaaatcata cttagtgatt atatgcaaaa caatttttaa attcaaggac 13260 
atatttttga tttttggaag gatgataact tttttgtgat tccgaaaaag attaaagtag 13320 
gtttaaaacc tctgaccttc tacagaaaaa acattacctc tatgaatttt ttttcatctc 13380 
gttcagaact tgtctcgggt caagccatga agacatgaga tagggtgtaa aacgttccga 13440 
agagaggttt atgactatta ttgtagttga agagaaaaat gatatctcaa tggatttcat 13500 
acagatggtc ggatttcatt cataaaatat cataagaaaa ggtacgttta tgactgtcta 13560 
ggtcaactgg ttttaggttt cttggaattg tttcaaacat ttttaggaaa tattttcttg 13620 
caaatatcta ctaaattgaa gtttgttatt gtttttgaca tattgtagat tttagagaag 13680 
aatcactcag agcaaaaatg ttgggaaaac gtgagaaaaa tccaagagac aaaagaatgg 1374 0 
tcttactatt agtagatcaa aaaaccagac caattattca tattcctact attcaatata 13800 
tattcaaaaa tgagcaaacc aagaaattgc acctaattta tcatcccaca tatattccga 13860 
cgaaacattc gctctacctt ctttttttct gtctaggaat tataaagggc cataattata 13920 
atttcagtca aggttttgga aaattgttcg actaaccatt atgaaagtta aaaaccaatc 13980 
agtcaaaaca cacaatagga atataaaatt cgtagaagaa aagctttttt tttggtcgaa 14040 
agcaaaatca aattctggaa ctgcgacttt tttagtgcaa ttatccattc aacgcaagtt 14100 
gtctttcaaa atttaaattc cagaagagtt ataacaaaac agacaggtgt acaagtaaaa 14160 
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gaaaaataca agttttatcg taaaaactga tacgaatcta gatacacctg ttaaaaaagg 14220 

ctttctcgaa acccagatgc cgtacgaagt aagcagcagc caactaaaca ttttgagtaa 14280 

acatatggca agtgttttgg cgcaaattgt aaagattttc cgtgtgggta actagaattt 14340 

gaaactgtaa gtatgacgac ttaaccacac aaaatcaaat ttcaaaagat cttaaaatgt 14400 

tcgaactttc aaaactttta agctctctcg catctaccgt agtcttctaa taacaacagt 14460 

cgtaagagaa agctcaaaat ttttcaaact ttttctgaat gacagaatca gttgtataca 14520 

aaaaaaaccc ccaaaatgcg agccccatga acctgacaac cagacaagtc gaaattgtaa 14580 

aatcgtatag atcttggttc acgacatgaa gagcaccgcg ggggcacacg agagcaacta 14640 

ctgcaagcgc tcctgaagag aagaaacatc ttttttccag gaccactggc cagtagtgct 14700 

cccccagatc actttctttt ttcttgcttc atctgatttg tgtctgcgtc gtctgatctc 14760 

tttagaacct atccttcttc ttcttctttt tgatacttcg acatcagaac aacatcgaca 14820 

tgtatcatct tttctctttt ttttttgtta tctattcatt cattcacttt tcatttagtt 14880 

tgattaatag gtgacatgaa ctcttgtcac ttttcaattt caacttctta aatcttaaac 14 940 

tcacagtgat tccagatatg agcaactcca atgaggtgtt gagtagaaac ctaaatataa 15000 

cattttggat gttttgataa tgttggaaca aataaattga aacaaacaag acttgaaata 15060 

gagacaacgt gcagaataat gtctaccagc tggtttcagt ggcatattgt accacgaacg 15120 

tccgacagaa cgaataacat aaagatcaag aaaaactgtt tgggagcaga caaacaatca 15180 

gaacacagtt ttgttgaggg gaccaaatca taattaatga ctaaatttta acgaagaaag 15240 

tgctcgaaaa gaacagaatt tagaagttga tgaacaatat ttttactttt agattaacaa 15300 

ttatgcttta caaatgacat ccaatctaaa gcatctggta atctgaaatt tgtcaaaaca 15360 

gctttcaaga ctagtttcaa atttgtcgat tcaatggatc aagtgtgtaa ttgatccaat 15420 

aaaaaagagt ataaagtgag aaggaagaaa gtgtgaaaaa agaagaacgt gaaacgtgca 15480 

gaagatacga aatgagtttg aagactgcac ttttcgagcc tcgatggtca gtcacttggt 15540 
cagttgcgaa aaagctgtga aaatgataca ttgtgtcggc tctcgtagag aagaaagcca 15600 
catggtcagg atgactccaa ctgggatatt cagttgtaaa gaacacaatt -gatatttttg 15660 

catctttttt aactagtttt tacaatatga gaaattgttc tgtgcgaaaa atatgacttc 15720 
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ttccttgttg 
tcagcagaat 
aaaatttctt 
gagcacaaaa 
aaaaaactac 
gaggcgggca 
tacat acaag 
agttgtggca 
aaaatt caca 
cggacgcgga 
tctcatcatt 
acactattct 
ttgttgataa 
catttttcca 
tattactttt 
ggaaaaattg 
agcaggagaa 
caaacaatgg 
aatgtattag 
agtttgtctg 
cagtgagaaa 
atgtgaagtg 
gactatttgt 
gaatttataa 
aattttgaac 
aggtgttttt 



ccgaagtgta 
gtgttcttac 
caaaatgttc 
cacctgccga 
aaaataacca 
acaagcgtcc 
tttctaaaat 
atgaaatgtt 
aagaaatgtt 
tattacacat 
tttttccaaa 
taaaattaat 
aatagtgcca 
ccaaaaaaca 
ttgacatgaa 
aacggcgtat 
gcggaccgaa 
actgttgtat 
aataggtttc 
aagtttatga 
aacacgtcat 
aattacgtaa 
ggaatgaaac 
ttaccggaac 
ctgcaaaaca 
tcacagaact 



tttccctgga 
attttgttgt 
ctacgtcttc 
cgcctctaaa 
atacaaattg 
tcatgacgcc 
ttaacttaca 
ttaaaaatct 
acaatctgtg 
tcagtcacac 
gtttattcta 
gtataatagt 
aacatcctta 
ctgaaatggt 
attttcaatg 
tgcaaaaatc 
gaattcagaa 
aaaaataaag 
actacctgtt 
cgggaagtgt 
tttggaacgt 
taaaatcagt 
aattgggggg 
aaaaatatct 
taagcacaaa 
ttaaacaaca 



-128- 

aattccagta 
aataataatg 
tatgcacatt 
atagttataa 
agtagggcga 
ttgttcattt 
aaatttaaaa 
aaatattgag 
aatgaagacg 
aataaacgtt 
aagttcaata 
ttagaaaata 
tgttacgcag 
ccataaaacc 
atgtaatgta 
ggtgtgcgga 
gcttttaaaa 
cggaggcggt 
gttgaactca 
ccatcaaata 
taaaatgggt 
tttattaagc 
gttttttttt 
ttaaattatt 
ataatggagt 
aatactcata 



aatacctaat 
tattaaaatt 
atttaggtca 
ctgcgcatga 
tggagaggtg 
agaatgtgtt 
aaagtcacaa 
ttttaaataa 
aacaatgaaa 
cggacactac 
ttttagtttg 
ttttgaaaca 
ttatccaacc 
tattcaaatg 
aaacaaagaa 
ggaggagaag 
taagaacggc 
agagagtcaa 
aaaaggtgtg 
actttcaaaa 
ggcaccgcaa 
ttattaaact 
ccaattttcgf 
aagatttgag 
ttttgtttta 
atttgaatga 



gtaaaaaatc 
gcattaatta 
cagtttcatg 
aatcaggtag 
ggcggttgga 
tgctttgaat 
caataataaa 
atgatttttg 
aagtgaggaa 
cacacatttc 
attattttgg 
tgaaactttt 
acatttttct 
gatatgagaa 
aaatattgcg 
gaaaaggaag 
gactttcaga 
agctttcaga 
aaaaagtgaa 
tttgacttat 
aatgttcaca 
aacccttccg 
attttttttt 
tgatgtttga 
aaatatcaat 
aaacagtaga 



15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 - 

16680 

16740 

16800 

16860 

16920 

16980 . 

17040 

17100 

17160 

17220 

17280 
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tcccacaata 
aaaacaaaaa 
aaattgggca 
gtgttcctag 
taaaacgaac 
aatgggcacc 
tggcttctag 
gaagagaaat 
gaggaacgga 
aagttatggt 
ggagatttca 
aaaacagtca 
atgacacaca 
tatagaaaga 
tagaaaataa 
gaacttgcaa 
tataagtctt 
tcaagaagtg 
ttatcaaaat 
aaaaccacct 
tctttgtcga 
tgctagggtg 
aattattcaa 
gagacaatga 
gctgacattg 
gaaattgaga 



ttttgaaaac 
gaaaaaaaca 
atatggtgaa 
tggaatgaaa 
agcggaagaa 
tatcatcatc 
ttcaagaaac 
atgagaaaat 
aacgttatct 
atctttgaaa 
ttcgaagaac 
atcacaggaa 
atatcacaga 
gagggtgaac 
caatttttga 
atttttagaa 
gcagtatttc 
ttcaggaaat 
tttagaggtc 
agtgtatggt 
tatgcgggtg 
tcaaaccaaa 
attctcgtaa 
tcgaaagtta 
gaacatatca 
gtagaaaaga 



ttatctatat 
aataatttgt 
gaaatacggt 
ctaacagaag 
aagatctagt 
tcacttgttt 
gaaaaaaaga 
gatgatgata 
ctgatcgcca 
gaaaagaaaa 
ctaacccaat 
caggtgtctg 
caaaatgatc 
aggagataga 
acaacaaaga 
ttatcatata 
ttaaatttaa 
tttaggacat 
tagtacacat 
aaagtacatg 
atggtgtttc 
tttatttcaa 
tttaatgaat 
acaaaaaaaa 
aaaagtggta 
tcattgaagc 
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atatatatat 
cagttgataa 
agttcgtcgc 
ctatacggta 
ggtcacttcc 
atttgaacag 
gcaataacgg 
ataataataa 
tctcattatt 
caggaaatta 
ttgaactcac 
tcttttcaaa 
aatgaggttg 
gaatcaaaat 
aataatttag 
aaaataacaa 
cagttcattt 
caaattttta 
ctacccaaaa 
agaagtgacg 
tgtgagcagt 
cccattcttg 
cactcagtaa 
ctgaggatta 
agtgaaagtg 
agaaatatgg 



atatatataa 
tttttagata 
actgtcagac 
tataatatta 
gatttctcag 
tctcgacttt 
aacagaaaat 
gttagaagag 
attatgagac 
tacagaacac 
tcccacttcc 
atgtatacgt 
cagaaaagaa 
ttgcatagat 
tggcatataa 
tgtttctata 
cttggtaatc 
tttattttct 
agaagacttt 
tgtctttggg 
aacaggaaat 
cttcaaaaaa 
ctgtaacgtt 
aacgttattt 
aaacgaaaag 
aagtgaattg 



ttacgaaaaa 
tgagttgcca 
taattttcaa 
ggaacacaat 
ctgacttttg 
ttccaattgt 
tcagaaagtg 
ggttatcgat 
acaaagatgt 
acacaatttc 
tcttgtctat 
tttccgaata 
tgcaaaaaaa 
aaatatgcaa 
tatagcgatg 
ttttatgccc 
tttattttta 
aaatctactt 
ggagctctca 
cagctggcca 
tctggacacc 
cccccaacta 
ttttttttca 
ggtatctaca 
tgcaacattt 
aaagccgtgg 



17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 • 

18420 

18480 

18540 

18600 

18660 

18720 

18780 

18840 
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cgccaaaacg acggtcaggc gccattgaga aaattaatga gagttcggaa ggttgaaaca 18900 
acacaaagac aacgtgaaaa attagtttgg agaagataaa aaatgtctgg agatggacga 18960 
tttcttagtt agctgagaat agtttacatt gattttcggg aaaacgcaga atgttagaaa 19020 
aatggaaaca tgtctagact tcagataaat ttgtagaatt tatatttgta gcaaaagcac 19080 
actaacaaag gttacaaagc tattaggaaa aatacggaat gtatttttga aaatttttga 19140 
tttctctaaa ataataacac cattaatttg ctatatttgc tatatatgct atatagtatg 19200 
ttcgcattac tgagcacaaa acttggaaaa agtttaaaaa aaaaggaaac ttgttttctg 19260 
gagaaatcat taaaaacagt acaatttcag acagaaataa atctttcagt gaaagctttt 19320 
ttttgagtaa gactaagtat gcactcacaa cttttctgag tgttccaaaa atgtttaaag 19380 
aaaatactag taaaaatgag catttcgaaa agcaatatat catacaacta cacaaacatt 19440 
tcaattaaag gaatcaattt tataatagtt ctaggcaatc ccacttttag attcaatttt 19500 
ctagcacagg gagcattgga agatataaaa acataaagat aaaggtgata aaagatccat 19560 
taaacacatc atatctatca aaccatcact tccatcaaat ccacagattt atcacaaatc 19620 
agtgtgtgac aaatataccg taatattaag ttcaaatggt ggaaaagacg cagacaaagc 19680 
ttttgcataa atactaaata attgaaagaa acgcagagaa tgtaagagaa aaatatacaa 19740 
tatgtgtatt atcaaccatc aacagttttt gattaaaacc atggagaagc gatatacagg 19800 
agcaaattag gagacgcaga ttgagaaaaa atgagaaaat aatgaaagta cggaagggtt 19860 
attgtacaat aagacaggta gcatctctca aagaacctat tgtcaagcag tttaaacatt 19920 
caacaacgtt catttatttt ttagccttca ttatgatatc tcattggttc tataattgga 19980 
ttttttaaat tcagatttct cattcatgta caagtaaagt tgttaattgg ttattatgcc 20040 
caaagtttaa ttatttgagc gcagaaaatt tgaatggaaa tttcagaaaa ctgattcatg 20100 
ctaacttcaa aaaatcctga ataaatacca attcttttcc aagtatgatt ctcgagcctg 20160 
tttacgtgcc tgcctacggt ctattttcta atttttttaa tgataaaatt ttagagtaga 20220 
tcttcaaaaa tcttccttaa aaaatctcca aaaaaatcaa gttcaggaaa actaaagtac 20280 
tccaataaaa tactcttatg caaaaacccc ccattcattt tgcagaaaaa gacaaacaag 20340 
aattaaagat aaaaagttat gatagacagg aagctgattt attagatcaa tgaatcgact 20400 
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tttagttttt cttgaactct aatttgaaat agtattcgaa tgagaaaatt gaaaatatac 204 60 
aaagatcaaa agttataatt gaaaatcaac aaattgatag tgtttgtata ggattaaatt 20520 
aaaatgtgcg gtacatgaga cagtagtagt agtagccata gtacgtattg gtggctccac 20580 
tcggctactg ataatttcct tttttactga taatttgatg tcatttcgta attttatttg 20640 
tgtttccaaa aattgtgggc gtggtttatg aattggtcaa gacatgaatt aaaggaattg 20700 
taaagtaaag aagaaaatga cagaggagaa attattttcg tttgctttgg aaattgcaaa 20760 
ataaattaga ttattaaaga taatagttac ggtttaaaat aaataggtga taaaaaaata 20820 
tccaaaagtt caagtcctaa gaatcttgct attttgcaaa aaaaaagcat gagcttttgg 20880 
cctaaaaatg gcggacagct gtcgggacac tatccaagaa ttcgtgataa acgggtgaag 2094 0 
caccgtctct tatcatcatg ccatttttcg aattttaaac tcagactttg ataaagaaaa 21000 
ttaaaaagag agagtgtgag aaataagagt acacatggaa aatgcaagat ttgaatttgt 21060 
ttccaatttt taaaatgtat ttaaaagagt taccgttcca tttttgatta gctttataag 21120 
tggaaaaatc gtttttggat tattttttga ggaatatttt tgaatgcgct ttcaattttc 21180 
ctataaaaaa ctttgtgttc acttttttat cccgttttta tttttatttt tacaactttc 2124 0 
aaatttttat gaatgtttta ttgtaaaatc ataaaaaggt gcgaaacatc taaattgcct 21300 
ggattgcatt taaaagtgca ttagcagaaa tgtattccta tggaatgttt tttgtgcaac 21360 
gagatccaga agctcgaaaa acatccaaat ttcttccaag aaagttgatg ttccaaaaat 21420 
aaaaaagatt ttagcccaat caactaaaaa aaaactctcg tttttttcat atttcacatt 21480 
ttctggtcac tttgaaggaa acactaatcc caaactgaga accgaacatg gattaaacca 2154 0 
tcccatttac tatttcttgt tgtcttcaaa aagtcttaga attgtgcaaa aaatagaatg 21600 
tttcgaaata ttgcggtttt cgttaaaacc ttttttgagt agattgaggg tccattagaa 21660 
ttcccaagag aacttgatga ccttcatcat caaaattagt ggtcattgaa tgtttgatca 21720 
gacaaaaatg gaaatgactg aatcggaaag agcaagaaaa tcgaaaaaaa aagtatttgg 21780 
aaattctgga aaacttttta aaatttaaga agggcaacga taagaaacag gaaattaggg 21840 
attttttagt gatggagaag tacgtgataa ggttaaggtg gaacactagt gcacacgttt 21900 
tgaatacact acgtgttttt atttatggta gaatatagca cttaaagaac gtttttaata 21960 
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caaactgaaa taaaaatacg gaaatgtaat 
gaattttcac atcaaacggt agtgattctc 
gtcttaaagt tttcgactat aatttaagta 
tgtcctatgt agccttttct acacttgaaa 
aaattcaagt agaaagttat ttagtcttgt 
gcaataaaaa gtcagctact gtttgattta 
gttactcccg tagtacggat acagagtgaa 
agtgaattta cagagaaata attataaaac 
tgcacagcca tattgttaca cttggagtta 
caaaaactct gaaagtcaca tcaagaaatt 
ttgaattgca ttgattccca agagaaatag 
ttacatacaa atatacacac aaaaaagagt 
cacaacaata tcgttccctt ttatctctaa 
cttcggagca tcttgtgctt gctccttgta 
tatgtgtgtt taggtgttgt ttgaacaaat 
acctttttct tatgcttctt cttgttttgt 
gccatagaca aggcggcgac atttttgaaa 
aatacaacga tcacaacaac aacacaacga 
ttaggaggag ttggagtttc tacaatcatg 
attgctcata aaattgtgat accatgaaga 
catacttaca acctttattc tgacttgaat 
tttgtttaaa agaaaattca attaggaaat 
gaaggcacgt gtaaatgcca cccggaaatt 
gattgtgaat tgaaaattta agagaatcat 
gattgaacag gaaagtgcag aaatatcaga 
cttttcaggt aaagaattat acagaaatcg 
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■tttttttttt gaaagaatcc gcctgaaact 22020 
tttatgcgtt gggtgatatg tatttacgct 22080 
atatgtttgt caaaaatcat catggtgctg 22140 
aatgataatt tttatttgaa aatggtattt 22200 
gtgccaagca ataaacacat agtctattag 22260 
aaaacttaga ctactggtgt gcctgtgcaa 22320 
aactagtgat tgtactttag atcggctgat 22380 
ttaaaatttt tagcagctca gtcttcaggc 22440 
caaattctgc aaaccatcta ggattgaatg 22500 
ccaacaaaaa acacattaga tgccaactca 22560 
tagtaaaagt gacccctatc cattcctccg 22620 
gtagacctct tccttctaac ccaaccaaca 22680 
ttctctgcgt ctccataagc tttgagagct 22740 
cggcggtaca gtttcctccc tctgctccct 22800 
aagtttttgg ccatccacct ccttctcaaa 22860 
gcacattttg gctcttgctt gtctgctcga 22920 
aaattatatt agtactgtta tatagtactt 22980 
aatgaaaaca tgagatcaaa agacaaattg 23040 
aaatgtttat ctagttatta taaaactgaa 23100 
ccgaaaaact ctatgcaact gcatactgca 23160 
ttcagttttt ggtgtttgca gttattctat 23220 
aagcaataaa ttttggcatg tatttcgata 23280 
agaaaaaata agatttctca aactgaaaat 2334 0 
tgcaaaagta cacaaatgaa tcatttttca 23400 
ttaccgtccc aacagaaacc ggaaataaca 234 60 
taataaattt aaaacaaaag agagttatga 23520 
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cacattgcag aacggtctct gtggaaaata ggaggaggtg ctgcaaaaac tccttagaca 23580 
tggtcatact tacaaaaaaa acagagttta actaaaaatt aaattaagtg agaaaatgaa 2364 0 
gaaaatggag gtctttcgcg gattcatttt acttcttctt ttttccactt ttcgttgcaa 23700 
gctttggttt aaaagtttcg caaacaaata aacaatgaac attgtgttga gaagacaagc 23760 
caagtgaaag gaaaccattg agagcaaaaa caacaatcaa ttgaaataaa gagtaaagtt 23820 
tattgaatat actgatatgt gaatactgga aaaataatta gtctctataa ttggtaccgc 23880 
ctggaagatt catttctgat tcccttgtgt ctttgaccaa aactttattt ttttcagttc 23940 
aaaattacaa aaaataaata ctcatcttca tcgattcagt ggtgttttaa actcctacgt 24000 
ttttctttta caataaaggt aatgtaaacg ttccgagcgt gtagttttct ctgaaaattt 24060 
tttaaaaata acaactttat ggtatttttc ttaaagtctt aaactgaaac cgaaacattt 24120 
ttgataggaa aactatttta acattttggg aactcggcaa aagctctgca ggcttgccga 24180 
acaactctca tttgaaagta ataaatatga aaataaatta tcgaagtttt tttttttgat 24240 
attttatgaa tacggctctt ggtagttttt gacgagaaaa ttacatgttg cataaatttc 24 300 
aagagttata actcatggag accctaattt ctggtttcac tagaaaatca aaaaatcaag 24360 
cgtttgagca gaagactgta ggaagagcac acgtcataaa aattagggga tcaacgatcc 24 420 
gaaacgggga attgaaatac gatatgcgat gagttttggt tcgaaccggc tttgtcccaa 24480 
aaaacaacag aacgatggtc tcaggctcac ttgactcatc tcggtgggaa caatttttat 24540 
ttgtttttat tccgtacgca cagaaacttt ttttgaggta tttttgatcg tgggtgggtg 24 600 
gaatggtagc acccaatttc aaatagtgtt tgatttgaag agacaatgaa agaaacaagt 24 660 
gggagataat ggaaatgacg tgatgaaatg gaacggagga aaactggtat aaatatcgtt 24720 
gactatcaaa actacaataa tactaatgga gaaaagttca ggattcttga agattttaca 24780 
ttatgatagt tgggatttac tggtttcaag ttcaaatgtc aaacatctgg aagaaaaacg 24840 
tataagatta catcaaaata aaactaaaat ttgaaggata aagtaaaaca gcataatata 24 900 
gtgttttaca tctcatgtag gaaacgaaca aaatctttga acacctagat aacttcaaac 24 960 
ggaagttggg tgaagaaaag aataggggcc agaatagaag gtcattttga caaagtgaac 25020 
agacaaagac attcctaact cggaggtatt ccaaaaactg ttccaatatt gaagaatgac 25080 
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actatttgat tttatatcat aacattatta 
atcgcaaaat aaaaagtggc cttgatgagt 
ataattaatt ataaaaatgc tgatacttga 
gttttcttta ttccgttctc actttgagtt 
agttcggcgt tttttgtcga taattccatc 
caaatatttt agccagagtg aaatttatta 
gtattattct gttgaaggaa catgttgcat 
tgtttagaac tttataacct cgtttaaaca 
tacatgagtt tgaaacagta tcagatacta 
ctatgagaga ttctgataca ttggtttcca 
ctcccactga attacagatt tctgttctag 
ggcatcttcc tcatttttat cttcagtttc 
tgcatcctta ccatctctcc ctctctcact 
aaatgttttc tgccacgtca tcatcatcat 
aattaccacg gcatcaaaat attcgatatc 
cacttaacta ttttctatgc gttattattt 
acgaacgtta tggtggaaaa cctgaaaatg 
attcaaacga actgtcatta atctttctat 
tccttggatt tggtccaaat gactcaaaaa 
aaccaccacc cggatttgtt gataactcac 
aaaaaacttg atatgaagtg ttgaaaaatt 
ctagttatct tgtatttcaa atattaatat 
gtacttcagt aattaccacc cacattttga 
aacttttatt aggatgtttt ctcattttgg 
ttttatgcat caaattaata tctaattttt 
tttataagat tttctcgcta ttaaattagc 
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atcacatggc ttttttctta ggaaatttat 25140 
cattcattca aaacatgcct aaaaaccttc 25200 
ggacccgttt ttttatattt ataaacagtt 25260 
tttttctgaa aatactaaaa aaattaacaa 25320 
tgattatttt cggttttttt acctaattat 25380 
tcttattaat atgtttttca atttgttttg 25440 
tttaaatctg ttgttaatac agcggccaca 25500 
taaattgtat gccatattta ttgcaagtac 25560 
tattttaaac aaaaatacac attttccccg 25620 
atttttttaa aaacttgaaa ttcctcaagt 25680 
atacctccaa agacacctag attcgacttc 25740 
atcttttgtc taattttccg tacatttctt 25800' 
cactcttctt gttcactaaa tctcaattca 25860 
caatgccacc ttctcagagc ccattcgaaa 25920 
acgaaaaatg cttctcaatt ccacttcata 25980 
tttatttctt tgttttcact atattttatc 26040 
ttcaagttac atcagcaatt tatgattcaa 26100 
ttgattcttc aattcgtcga cgggaaatat 26160 
catcaagaaa tgaaactcaa attgagctta 26220 
aaatttcagt aagtttagga ttttttttca 26280 
gataattggg ccgggcttac atcagagtat 26340 
tcaaacattg tagagattcg aaatgcgaca 26400 
ctgtcaaaaa agttcccaaa aattgtcgaa 264 60 
cacgattgga gtgttttttt aacaaatccc 26520 
aaatcaataa tttggattaa ttcaacttgt 26580 
aaaaaaaaac tatcttcaaa caattagcgt 26640 
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gctttaaaac tactaggcct ttgttggcaa cgtcttttca cattttggca caaaactata 26700 

aactatgctc agaatttggt aatgtttgaa aatgttttgg gcaagcatat agttattcca 26760 

attctaaagt aagattagtc atctattttc cattccattt ttccattttt cacctatttt 26820 

ttccattatt taacaaccaa gactgagcaa acattttcct gttttaattt tcatatatga 26880 

aaagacataa gcaaaagctg gatcaaagct tgggcaaatc ctattcaaag tattttccaa 26940 

cgtttccatt ccctcgtttg taaagtacaa ttggtaatct taaggcttaa ttaattattg 27000 

tgggagattc ataatgtgaa aactaaatgt taagatttgg tcatcaattg aaaaggaaaa 27060 

accccagtct ttaactgtga atgcagaaca tccaaagtca ttgcttttac gagatcacac 27120 

aggacatcca tatttagaag taagttcaaa tcagaaatcc ccaatccatt ttttcttgta 27180 

gttaccactt caagaaccat actccgattt tcgcgacatt gttagttgtt tcagtccaat 27240 

ttatggagat tttgagatgg ttttaacagg tttaacaaat taatttggtt tcttttttaa 27300 

aacatttaat ttttatagct ttaacatcat ccatatcaat gggatcattt gttagtatac 27360 

catatgaaga gcttactgga gagctttaca agtttctacg tgtatttgaa aaaacgggac 27420 

atgtcaggtt aactgcattt ccaatgatac gtcatcagcc tcgcttcgat tcggaaaatg 274 80 

aaaattatca tttgaaaatg atcaaactta aaacagattt aacgcatttg cattgttggc 27540 

taatgcataa aaaccgggcc aaattcatga tcttccaaaa ctctgctgaa attgttttac 27600 

cgatttcctc gacgctggaa aatcccaatt acgcctctga atttacacga atatttgaaa 27660 

caccacgagt tgaaggatat gatattttag aatataatgt caaaatttca acggataaac 27720 

gcttaggcga cttttcggat ttctccatca ggcagacaat tgaagcagca aaagcagaag 27780 

aattaaccgg aaattctaaa acattaatca tgagaatggt atcacttttt ttcaaaataa 27840 

tttactgttt ctattttggc atttatttca gcattctcca actccacaga atctcttaaa 27 900 

acgcggtaaa atgtatccat ttttcaaaaa tttcccatct ccaccacaag ttattccaaa 27 960 

gaaaacattg gacaaattgg atacaataac agaaataatt gaagaatctg atgcattctg .28020 

gacacttatc aaagaatgtt cagaaaattc gaaatcttgg aaatgctcgt caagaaaatg 28080 

tgtaagacca tcagttagac atcgatctct tcatggatgg tattcatatg atattcattt 2814 0 

ttctaaattt ttgaatgttg aaagtttttt ttgttcagat tttcaataaa cttttaagaa 28200 
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aagaataatt ttaaattcta taattcctga atttccaact atgtttatca tttcccaaag 28260 
tacattcgaa aaagctcaat aagcaaaacg accacgaaat aacagtatta aaaaaaaaga 28320 
tgttgtcatt tgaagttctg gagtgcgatg aaaagtctct cacctcggac tttctgtaat 28380 
ttatttagca tacaacatga atttgaccaa ctcgaaataa ggttaagact gaaaattttt 28440 
cacaaaaatt ggaacacttg cgaagcgaat tcaagacttt tcgaagttat taaacaagct 28500 
ttcaaattct cagtaaaact gaacgttttt tttatgctct ccaaatcatt ttaatatggc 28560 
tgctcgcgtc gctgaagtat tttctagagt atgtttaata aaactaatat gtaaatgaaa 28620 
aaccaaaaac tcagataaag agcataactt ttataacgca ttttcagaac tcttcaagct 28680 
ttttcagatc acttctatca gcagtattct tcttttttcc aaagacacca agaactgaaa 28740 
aggttgaagg agcatcaccg gaaatagagg atgactgctt attgttcttc tttttctgaa 28800 
taaaatcaaa ttaaacaccg aaaatatgaa acatattcac taacctgaac agctttcagg 28860 
tttgatttat tctgattttc cgccgctgat ~ctgctctgac-~ttttgaaacc gggacttgga 28920 
gagttaccat tgcgtatgcg agttcgaact ggacgccgat tcttctttct gaataaacga 28980 
attatacaaa tttgtatttg aaaacggaca acatacactc cttcttccgc cgaattgctc 29040 
atcgattttc tcatttcttg tgttttttcc tggcgttcag gttcaaaagg tggagcaact 29100 
ggtttggaca tatacggaag aatgttcgag acttgaatct tttttggttg ctcaatattc 29160 
tccattggaa tatgatcggg aagttcaaag tagctgttgg atcctggagc ttgatcaaat 29220 
ccttcgagag ttaaaagttc acgaactgct tcactcattg tgaccctttc ctcttcggca 29280 
ccagcacaga ttctatactg aaattgcttg ttgtgttgtt ttactcaaaa gaatagtgaa 29340 
caaaattttc tcaccgtaat gaatctgaca atggctggtg ggacgttagc ttcaaatggc 29400 
attcggtatc cgttctgaac acgtggtaaa acctcagcaa ctttcattcc cggataaggt 294 60 
tcgattccat catggtacac ttcccaacac atgactccat aagcgaaaac atcagtcttt 29520 
ggagtataga acccagttct tggaacttct ggagccaacc atctaatagg aactctgaaa 29580 
aatttgaaaa ggttggaatt tttgacgttc tctaactttt tgtgaggatt catccgatag 29640 
ctatagcctt ctcgtgacag tccaaagtcg gatatcttta cttgtccatt cccgtagaga 29700 
caatttctgg acgcaatatc gcgatgaatt atttgaagtg aatgaagata ttcaagacca 297 60 
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agaccagctt gaagaaccat cgtatgtttc ttggaaattg gcaatgaacc aatgttcttc 29820 

tttagatatg aatccaaagc tccattgtca gcctaaaata atttacataa gacatttttt 29880 

cttagtaaaa taaaattaat cagttaatta attaacatac caactccatt atgaccatca 29940 

aaggttcctg tcctgcagcc acaccataaa aagtgacgac attcggatgt ttgaactttc 30000 

tcatcaatct ggcttcgtgc atgatttctt tgatctgctc ttttgtcaaa gattccaact 30060 

ttgccagctt gattgcagct tttttgacgg tatttcctat gcgaatttct cccaattgaa 30120 

cctctccaaa tgctccttct cctaatttct tgattaatgt cacgtcagaa tgttgctttt 30180 

cccacggttc acgaccaatt ggacggatga ttacagtttc tgggccctaa aagcaaacaa 3024 0 

atgaaaataa gtttactcac ttaatttgta agatcacccc agcaacaggt tctttagaac 30300 

gatgatagta attgagaaga tctgcgatac tagaaaacca ttttttatca actgcaaact 30360 

tgttattgtg ctctcgaatt acataatgac gaatctgaaa taatattctt aaaaattatg 30420 

agcaatcgtt ttacgtacgt cctcaattac tccaacatag acagagagaa caaatttcct 30480 

tggctctccc acttttggat cagtaaatcg aactagaaaa tcgcctcgtt gagtgagcaa 30540 

ctgtttcata tcctcacgtg gcaataagcc atggtaccag ggttcttttg caagtacttg 30600 

c 30601 

<210> 34 

<211> 8009 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 34 

ggatccttgg ccacgccatg ggcgatgaaa ttgaccgcgt cgtaacgggt catgtcctgc 60 

tcctgcagga agaaggccgc gttcgattcc cgttccgcaa agatcgcgac aaggacattc 120 

gcccccgtca cctcggtccg gcccgagctt tgcacatgga tcgcggcgcg ctggatcacc 180 

cgctggaagg cggcggtcgg cacggcttcc gagccttcga cttcggtgat cagcgtcgag 24 0 

agatcatcgt cgatgaactc ggtcagggtg gtgcgcaact cgccaagatc gacgccgcag 300 

gcgcgcatca cgcggctggc gtcgggctcg tcgatcagcg cgacgagaag atgttcgagc 360 
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gtcgccagtt catgtttgcg cgtgttggcc agcgccagtg cggcgtgaat tgcttgctcg 420 
agcgtggtcg aaaacgaagg catgcggcgc tcctttcctc gggtctcccg atactggcct 480 
catgtgatta agtttcggtg gatttcgccg cgcttcaagg cccggacgcg tgttttttcc 540 
acctctcgcc gctctgttgc aaaactgacc agcgcggcgg gctttcgcgc gatccgcagg 600 
cagcgcgcga aagtgctttc agaaccggtc cttgcgcgcc cgcgccgcgg tgaagacggc 660 
gagcggcgcg gcatccccgg gcaggccgag cgcggcgcgc aacgcagcgt catcggcgcg 720 
caaaaaggga ttcgttgccc gttcctcgcc caaagtcacc ggcaaactgg gttccccggc 780 
cagccgcaag gccgtcaccc ggtccatccg gtcgtgcagc cgaccgttcc ccggttccag 840 
gctgagcgcg aaccggccgt tcgcggcggt gtattcatgc cccgaacaga cccgggtttc 900 
gggcggcagc gcggccagac gggtcagcgt gtcgaacatc tgcgcggggg tcccctcgaa 960 
gagacgcccg cagccccagc tcatcaggct gtcgccggaa aagagcagcc ccgccccggg 1020 
cagataccag~gcgatatggc cgagcgtatg gccgtcggcc gcgatcacct gcgcggcctc 1080 .~ 
catccccaga tgcagcacgt cgcccggggc caccggatga tcgagcggcg gcagccggtg 1140 
ggcatcggcc gcggcccccg ccaccttggc cccggtcgcc tgcgccagcg cctcgacccc 1200 
cgcgatgtga tcggcgtggt gatgggtgat caggatgtgg tgcagctgcc agcgccggtc 1260 
ggtcagcacc ttcagcaccg gggccgcctc ggggacatcg accaccacca cggtatcggt 1320 
ggcggtgtcg tgccagagcc aggcgtaatt gtcggtcagg caggggatcg gggtcagttc 1380 
gagggtcatg gccttttgcg catctttcgc tatcctgacc cagcttcgcc caaggaaggc 1440 
caacctgcaa tgcatctcga cgtgctcgac ctgcgtgatt tctactaccg cacccaattg 1500 
gggcgcacgg cgcaaaaggc gatccgcgac aaggtggtcg aactctggcc ggacacccag 1560 
tccggcatgg ccgggctgac ggtggcgggc tacggcttcg cggtgccgct gttgcgcccc 1620 
tatctgggcc gggcgcggcg ggtgatcggg ctgatgcccg cgcagcaggg cgtgatgccc 1680 
tggcccgccg gagagcccaa tgtctcggtg ctctgtgccg aaaccagctg gccgctggag 1740 
accgggatga tcgaccggct ggtggtgctg cacgggcttg aagtctccga cgaccccgat 1800 
gcgctgatgg aggaatgctg gcgcacgctg ggccccggcg ggcgggcgct gttcatcgtg 1860 
ccgaaccggg tcgggctttg ggcgccgcgc gaaaccacgc ccttcggctt tggccgcccc 1920 
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tatacgatgg gccagctcga ggcgcaggca cgacgggtgg ggtttgcccc cgaacgtcag 1980 
gcggcggcgc tgtacattcc gccctcgcag cggcggttct ggctgcgctc ctccgagatg 204 0 
tgggaacggc tgggcacaag ggcggcgggc tatctggcgg cgggggtggt gatgcttgag 2100 
gtgatcaagc aggtgcattc ggtgcgccgc tcggggcttg gcgcggcggt gcgcaagccg 2160 
ctctcgatcc ttgaaggggc gcccaagccg gtggtcgggc ggatgtgagc cgcccgcggc 2220 
cgcaagaatc gcccggccgg aaaagcccgt ttccgcggca cttcgccctg cggcggggaa 2280 
acgcagcggg gcgggcttcg accctttgcg ctaacactcc gtgccggtgc agaaaatgtg 2340 
ccagcctgat gcggattcct gccgccaaga tggttgcgag ggtcttgatg ctctgctaga 2400 
cgcaaccccg aatgcggcgt gcgagatcat tttgggcgcc gaggggggcc tctgaatcgg 24 60 
tgacggaacg attggttccg gtgtccgcgt gcggaggcaa aagcatcgga agggtggacg 2520 
.tgtccgaacc agcttcgatt tccgcagcca ttgccgggcg ttatgccacg gccatcttcg 2580 
acctcgcgca ggaggccaag ggcatcgacg cgctctcggc cgacgtggac gcgctgacgg 2640 
ccgccttggc cggttcggcc gagctgcgtg acctgatttc ctcgccggtc tacacccgcg 2700 
aggagcaggg ggacgcgatc gccgcggtgg ctgcgaagat gggcctgtcg gcgccgcttg 2760 
ccaacggtct gaaactgatg gcgacgaagc gccgtctgtt cgcgctgccg cagctgctca 2820 
agggcctggc cgccgcgatc gccgaagcca agggcgagat gaccgcggat gtcacctcgg 2880 
ccaccgcgct gagcgcggcg caggccgaga agctggcggc gacgctggcg aaacagacgg 2940 
gcaagaccgt caaactgaac gtcgccgtcg atgaaagcct catcggtggc atgatcgtca 3000 
agctgggttc gcgcatgatc gacaccacgg tcaaagccaa actcgcttcc cttcagaacg 3060 
ccatgaaaga ggtcggataa atgggcatcc aagcagctga gatttctgcg atcctcaagg 3120 
agcagatcaa gaacttcggg caggatgccc aggtcgccga agtgggccgc gtgctctcgg 3180 
tcggtgacgg gatcgcgcgc gtgcacgggc tcgacaacgt ccaggcgggc gagatggtcg 32 4 0 
aattccccgg cggcatccgc gggatggcgc tgaaccttga agtcgacaac gtcgggatcg 3300 
tgatcttcgg gtcggaccgc gacatcaagg aaggcgacac cgtcaagcgc accaacgcca 3360 
tcgtggacgt tccggcgggc gaaggcctgc tgggccgcgt cgtggacggc cttggcaacc 34 20 
cgatcgacgg caagggcccg atcgtggcga aagagcgtcg catcgccgac gtcaaagccc 3480 
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cgggcatcat tccgcggaaa tcggtgcatg 
acgcgatgat cccgatcggc cgcggccagc 
gcaagaccgc gatcgcgctc gacaccattc 
cgggcaacaa gctgcactgc ttctatgtcg 
agctggtgaa gaagctcgaa gaagccggcg 
ccgcttcgga cccggcgccg atgcagttcc 
aatacttccg cgacaacggc atgcacgcgc 
ccgtggccta tcgtcagatg tcgctgctgc 
cgggcgacgt gttctatctg cactcgcgcc 
atttcggttc gggctcgctg accgcgctgc 
cggccttcat cccgaccaac gtgatctcga 
aactgttcta ccagggcatc cgcccggccg* 
gttcgtcggc ccagaccaac tcgatgaagt 
cgcagtatcg cgaaatggcc gcctttgcgc 
aaaagctgct gaaccgcggt gcccgtctga 
cgctgaccaa cgccgaaatc gtggcggtga 
ccgttccggt gaaggaagtc ggccggttcg 
cccgcaagga cgtgcttgag tggctcacca 
agaagaagct caaagacgcg atcgccgagt 
ggacagggag atgcccagcc ttaaggacct 
tcgcaagatc acgaaagcga tgcagatggt' 
aagcgccgaa gctgcccggc cctatgccga 
cggtgcggtg ggctcgaccg acggtgcgcc 
ggtccatctc ctcgtcatca tgacgggcga 
tatcgcgaaa ctcgcgaagg cgaaggcgat 
gatcctcacc gtcggcaaga aaggtcgcga 
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agccgatggc gaccggcctc aagtcggtcg 3540 
gcgagctgat catcggcgac cgtcagaccg 3600 
tgaaccagaa gtcgtacaac gacgccaacc 3660 
ccatcgggca gaagcgctcg accgtggcgc 3720 
cgatggaata caccaccgtc gtcgccgcga 3780 
ttgcccccta ttcggcgacc gcgatggcgg 3840 
tgatcatcta tgatgacctc tcgaagcaag 3900 
tgcgccgtcc gccggggcgt gaagcctatc 3960 
tgctggaacg ttcggcgaaa ctgaacgagg 4 020 
cggtcatcga aacccagggc ggcgacgtgt 4 080 
tcaccgacgg tcagatcttc ctggaaaccg 414 0 
tgaacaccgg tctctcggtg tcgcgcgtcg 4200 
cggttgccgg tccggtgaaa ctggagcttg 4 2 60 
agttcggttc cgaccttgac gccgcgacgc 4 320 
ccgagctgat gaaacagccg caatattcgc 4 380 
tctttgcggg caccaacggc ttcctcgatg 4 440 
agaaaggcct gctggcctat ctgcgctcga 4 500 
aggaagaccc caagatcaag ggcgacgccg 4 5 60 
tcgccaagac cttcgcttga cggcctgaaa 4 620 
caagaaccgg atcgtgagtg tcaagaacac 4 680 
cgcggcggcg aacattcgcc gcgcccagga 4740 
gcggatgaac gccgtgatgt cgagccttgc 4800 
gcgcctactt gcgggcacgg gctccgacaa 4 8 60 
gcgcgggctt tgcggcggct tcaacgccaa 4 920 
ggaactgctg gcccagggca agacggtgaa 4 980 
cgcgctgcgt cgtgatctgg gccagtatta 5040 
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catcgatcac atcgacctga gcgacgtgaa gaaactgagc tacccggtgg cgcagaagat 5100 
ttcgcaaaac atcatcgacc gcttcgaggc gggcgaatac gatgtggcga cgatcttett 5160 
ctcggtcttc cagagcgtga tcagccaggt gccgaccgcc aagcaggtga tcccggcgca 5220 
gttcgaaacc gatgcggcct cggcctcggc ggtttacgac tacgaaccgg gcgatcagga 5280 
aatcctgacc gcgctgctgc cgcgtgcggt ggccacggcg atctttgccg cgctgctgga 5340 
aaacaacgcg tccttcaacg gggcgcagat gtcggccatg gacaacgcca cccgcaacgc 54 00 
gggtgacatg atcgatcgct tgaccatcga gtataaccgc tcgcgtcagg ccgccatcac 54 60 
caaagagctc atcgaaatca tctcgggcgc cgaggcgctc tgacggaacc ggagatagaa 5520 
gagaatggca agcaaaggca aagtgaccca ggtcatcggc gccgtcgtcg acgtgcagtt 5580 
cgaagacggc ctcccggcga ttctgaacgc ccttgaaacc accaacaacg gcaagcgcct 564 0 
cgttctcgaa gtggcgcagc acctgggcga gaacaccgtc cgcaccatcg cgatggacgc 5700 
gaccgagggt ctcgtgcgcg gcgcggccgt gtccgacacc ggcggcccga tcaccgttcc 57 60 
ggtgggcaac gccaccctgg gccgcatcct gaacgtcatc ggcgagccgg tggacgaacg 5820 
cggtgacgtg tcgaaagccg aagcccgggc gatccaccag cccgcgcccg atttcgcggc 5880 
gcagtcgacg gaaagccaga tcctcgtcac cggcatcaag gtgatcgacc tgctcgcccc 594 0 
ctattccaag ggcggcaaga tcggtctctt cggcggcgcc ggtgtgggca agaccgttct 6000 
gatcatggaa ctgatcaaca acatcgcgaa agtgcactcg ggcttctcgg tgttcgcggg 6060 
cgttggcgaa cggacccgtg agggcaacga cctttaccac gagatgatcg aatcgggcgt 6120 
tatcaacctc gagaagctcg aagaatcgaa agtggcgctg gtctacggcc agatgaacga 6180 
acccccgggg gcccgtgccc gcgtggcgct gaccggcctg accctggcgg aacagttccg 624 0 
cgaccagtcg ggcaccgacg tgctgttctt cgtcgacaac atcttccgct tcacccaggc 6300 
cggttcggaa gtgtcggcgc tccttggccg tatcccctcg gccgtgggct accagccgac 6360 
gctggccacc gacatgggcg cgctgcaaga acgcatcacc tcgaccaaag ccggttcgat 6420 
cacctcggtt caggccatct acgttccggc cgacgacctt accgacccgg ccccggccac 6480 
gtcctttgcc cacctcgacg ccacgaccgt tctgtcgcgt gcgatctzcgg aactcgggat 6540 
ctacccggcc gtcgacccgc tcgactccac ctcgcggatc cttgacccgc aagtcgtcgg 6600 
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cgaagagcac tatcaggtcg cccgtgacgt ccaagggatg ctgcaacgct acaagtcgct 6660 
gcaggacatc atcgccatcc tcggcatgga cgaactgtcg gaagaagaca agctgacggt 6720. 
ggcccgcgcc cggaagatcc agcgcttcct gtcgcagccc ttcgacgtgg cgaaagtctt 6780 
caccggctcg gacggcgtgc aggttccgct cgaagacacc atcaagtcgt tcaaggcggt 6840 
ggttgcgggc gaatacgacc acctgccgga agcggccttc tacatggtcg gcggcatcga 6900 
tgacgtgatc gcgaaagccc agcgcctcgc cgctgcggcg taagggggaa ccatggccga 6960 
taccatgcag ttcgatctcg tgtcgccgga acggcggctt gcctccgttg ccgcgagcga 7020 
ggtccgtctt cccggcgtgg aaggcgatct gacggcgatg ccgggccatg cgcccgtcat 7080 
cctctcgctg cgtcccggca tcctgaccgt ggtcagcgcc gcgggcacgg ccgaatacgc 714 0 
cgtgaccggc ggcttcgccg aggtttcggg cgagaaggtg accgttctgg ccgagcgcgg 7200 
tctgacccgg gcggaactga ccgccgcggt tcatgccgag atgctggccg aggccaagaa 7260 
agtcgcggac gccgcgcatc cgtcggtggc cgatgccgcc gcgaagatgc tggccgacat " 7320 
ggaagcgctt ggctcgcaca tcaatctctg acgggacatc ccgccggata tctcgggccc 7380 
cggtcatcgc gccggggccc ttgctttttg cttttgtctt gccgcgccgc atattagcgt 7440 
gaaggtgcag gcagccggag tgagcgacag gaacggatga agaagttttc ctcgacccgg 7500 
atcggcgtgg cccagggatc gctggtgctg ttttcggatt atctggacgg cggcgtgatg 7560 
tggacgggcg agggcccgcg cgaattgcgc aggctggtgg tgttcgacga agccttccgc 7620 
gagatcccgg cggtgcaggt gtcgctgtcg atgtgggaca tcgaccagaa gcacaatccg 7680 
cgcatggaca tttccgccga catggtgacg gccgagggct tcgtgatcgt ctttcgcacc 7740 
tggggcgaca cccgcgtcgc ccgcgtccgc gcggactggc tggcgatcgg cggctgcgcc 7800 
aatgacgacg actgggacgt ggcctgatcc cgcccggctt gactttccgc ccccccgcgc 7860 
cgatggtgcg cgcgactttc ccatccaacg aggcccgccc gtgcaacaag atgccccccg 7920 
ctggcagctc gtggtgatcc tgtgggggac gaaatatccg gtcgccgaac itcaacgccct 7980 
gatcgagacc gtgtggcccg ggcctcgag 8009 



<210> 35 
<211> 9810 



WO 99/37660 PCT/US99/01313 

-143- 

<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 35 

gatatcgggc ttgtcatttt cgattgcgac ggggttctgg ttgattcgga agttctggcc 60 
gtggccgtcc tcatcgcaga actggaccgg gcgggcgtgc gggtcgacga ggccttcgtg 120 
catcggcatt ttctgggccg gagcttcccg gctgttcagg aggtcgtgca gcgccagttc 180 
ggcgtgaccc tgcccgagac cttccaggtc gaggaacgtg cccggctgct gtcagccttc 240 
gagaccggcc tgcgggccat gctcggggcc gcggagaccg tccgcgcgct gtcggtgccc 300 
tactgcctcg ccacgtcgag cacgccggcc cggctcacgc gctcgctgga gatcacgggc 360 
cttgcggccc tcttcgaggg acgctgcttc accgcgagcc aggtggcgcg cggcaagccc 420 
gcgcccgatc tgttcctgct cgccgcggcc gagatgggcg tcgcgcccga acgctgcctc 480 
gtgatcgagg ataccgagcc cggcgtgcgc gcaggcctcg cggccgggat gcaggtctgg 540 
cgcttcaccg gcggtagcca tttcgcgaac cgatcccccg aggatgcgcc cgatgccctg 600 
ccgcaccggc ggttcgacag cttcgaccgt ttctacgaga ccctgcccgg cctgcgccgg 660 
gccaagtgcg agaccctgac atgatcgacc ggcccgagag cgagccgacg cccctcgacg 720 
atgccgcgcg cgcgggctgg ctctattatg tcgcaggcct gactcaggat cagatcgcgc 780 
gggagctcgg cacctcgcgt cagcgggcgc agcggctggt gagccgggcc atctccgaac 84 0 
ggctgatcca cgtccggctc gagcaccggg tctcgggctg cctgcatctg gaagccgcgc 900 
tcctccggcg cttcgggttg aagctggccc gcgtggcgcc gagtctcggg tccgaggtgg 960 
atcccctgcc ctccatcgcc cccaccgccg ccgccgaggt ggagcgggtg ctgcgctcgg 1020 
agcggccgat ggtggtggcc ttcggcaccg gccggtcgct gcgcgccacc gtcgaggaga 1080 
tgacctcgat ggtctgcgaa cagcacaaga tcgtgtcgct caacggaaat atttctgcgg 114 0 
atggctcggc ctcctactac gatgtgatct tccgcatcgc cgaccgtgtg cgtgcgccgc 1200 
actatccgat gccgatgccg gtcatcgcgc aggatgcggc ggagcgggag ctgtttcatg 1260 
cgctaaagcc cgtgcagtcg gtgctgcggc ttgcgcgcaa tgccgatgtg accttcgtcg 1320 
ggctgggaca gatgggcgag gacgcgccgc tcctgaagga cgggttcatc acgcccgagg 138 0 
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agctgaccga gatgcaggat ctgggcgccg 
cggagggtcg ctacctcgaa accagcatca 
tttccgagga tcggacggtg gtcgccatcg 
acgcaggctt aaggggccgt cttttcaacg 
cacttctgtc ctgaagccgc cgaaaggcgc 
tcggtgagta attattcgcc gcacgaaata 
gatgaccgca agatttcgcg ccctgatggg 
cgccggcgcc gaaaccatca ccgtggcgac 
ggggctcatg tccgagttca acgcgcagca 
cgaggaaaac gtgctgcgcc agaaggtcac 
cgacgtgctg accatcggca cctacgaggt 
gagcctgaac gacctgccgc cggagtatga 
cggcctgacc gtcgacggcg agctctatgc 
catgtatcgc aaggacctga tggagaaggc 
ggacttcgtg aaggaagcgg cgcagaagat 
ctgcctgcgc ggcaaggccg gctggggcga 
cagctacggc gcgcgctggt tcgacgagaa 
gaaggccacg ctgaccgact atctcgacat 
gaaaaacggc ttcaacgaga acctcgcgct 
cgacgcgacg gtggccgcct ccttcgtgac 
ggtgggcttc gcgctcgccc ccgataccgg 
ctggaacctc gcgatcccgg cgggctcgca 
ctgggcgacc tcgaaggact atgccgagct 
gcctccgggg acgcggacgt cgctctacga 
gaagatgacg ctcgacagca tcaacgcggc 
gccttacgtc ggtgtgcagt tcgtggcaat 
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tcggagaggt ggcgggatgg gtcttcgact 1440 
atcagcgggt tgcgggcgtc cgtgtcgaac 1500 
ccggtggcag gcgcaagctc gcggcgctgc 1560 
gcctgatcac cgacgagttc acggcgcagg 1620 
ggcaaaaagt atttgacagg ctggcacccc 1680 
atgctcaccg tgcaggccag ggaggatact 174 0 
cgcgtgcgcc gtggctgcgc tctcgtccgc 1800 
tgtcaacaac ggcgacatga tccgcatgca 1860 
ccccgacatc accgtcgagt gggtgacgct 1920 
gaccgacatc gccaccaagg gcgggcagtt 1980 
tccgatctgg ggcaagcagg gctggctcgt 204 0 
tgccgacgac atcctgcccg cgatccgeaa 2100 
cgcgcccttc tacggcgaga gctcgatgat 2160 
ggggctgacc atgcccgacg cccccacctg 2220. 
gaccgacaag gatgccgagg tctacggcat 2280 
gaacatggcc ttcctcagcg ccatggccaa 234 0 
ctggcagccg cagttcgacg gcgaggcctg 2400 
gatgacgaac tacggcccgc ccggcgcctc 24 60 
gttccagcag ggcaagtgcg gcatgtggat 2520 
caaccccgag gaatccacgg tggccgacaa 2580 
caagggcaag cgggccaact ggctcggggc 2 64 0 
gaaggtcgat gccgccaagc agttcatcgc 2700 
ggtggcctcg aaggaaggct gggccaacgt 27 60 
gaacccggaa tatcagaagg tgccgttcgc 2820 
tgacccgacc cacccggcgg tcgatccggt 2880 
ccccgagttc cagggcatcg gcaccgccgt 2940 
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gggccagcag ttctcggcag ccctcgcggg ctcgatgtcg gccgagcagg cgcttcaggc 3000 
ggcccagcag ttcacgacgc gcgaaatgac ccgcgcgggc tacatcaagt gagcccttcc 3060 
gcgggccggc cetgagcggc cggcccgcac cgcttgccgc ttccggccgt atccgccgga 3120 
ggcctttccg ccccatcagc cccgaggcct ccatggcgac ccagcattca aagactgcgg 3180 
cgcgtctgat gatttccccg gccgtgatcc tcctgttcct gtggatgatc gtgccgctgt 3240 
cgatgacgct ctacttcagc ttcctgcgct acaacctcct catgccgggg atggagagct 3300 
tcaccggctg ggacaattac tattacttcc tgaccgatcc ggccttctcc gcggccctga 3360 
ccaacacgat cctcctcgtg gtcggcgtcc ttctcatcac cgtggtgggc ggggtcctgc 3420 
tcgcgctcct gctcgaccag cccttctggg ggcagggcat cgtgcgcgtg ctggtgatcg 34 80 
ctcccttctt cgtcatgccc accgtctcgg cgctggtctg gaagaacatg ttcatgaacc 3540 
ccgtgaacgg gatgttcgcc catatcgccc gcgggctcgg ccttccgccg ttcgacttcc 3600 
tgtcgcaggc gccgctggcc tcgatcatcg gcatcgtggc ctggcagtgg ctgcccttcg 3660 
ccacgctgat ccttctgacg gcgctccagt cgctcgaccg cgagcagatg gaggcggccg 3720 
agatggacgg cgcctcggcg ctcgaccggt tcatccacat caccgtgccg cacctgacgc 378 0 
gtgccatcac cgtggtggtg ctgatccaga ccatcttcct tctgggcgtc ttcgccgaga 384 0 
tcctcgtcac gacgaacggt ggacccggca ccgcctcgac caacatcacc tacctcgtct 3900 
atgcgcagtc gctcctgaat tacgacgtgg ggggcgggtc ggccggcggc atcgtcgccg 3960 
tggtgctcgc caatatcgtg gcgatcttcc tgatgcgcat gatcggcaag aatctggacg 4 020 
cctgacatgt cacgccgcac ctcaacccgc cgcacgctga tcgtcacgct cgccgcctgg 4 08 0 
acgatagcct tcctcatctt cttcccgatc ctctggacgg tgctgatgag cttcaaatcg 414 0 
gaaggagacg ccatcaaggc gcccttcgcc atgctcttct cggactggac cctgcaatcc 4200 
tacgccgatg tgcaggaacg gtcgaactac gcccgccact tcatgaattc ggtggtgatc 4260 
tcgctgggct cgaccctcgt ggcgctcgcc atcgcgatcc ccgccgcctg ggccatggcc 4 320 
ttcgtgccgg gccggcggac gaaggacgtg ctgatgtgga tgctgtcgac caagatgatg 4380 
ccggcggtgg gcgtgctcat cccgctctat ctgatcttcc gcgacacggg ccttctcgac 4440 
acgcggatcg gcctcgtgat cgtgctcacg ctcatcaacc tgccgatcgt ggtctggatg 4500 



WO 99/37660 PCT/US99/01313 

-146- 

ctctacacct acttcaagga gatcccgggc gagatcctcg aggcggcgcg gatggacggg 4560 
gcgacgctcg gctccgagat cctctatatc ctcacgccga tggccgtgcc gggcatcgcc 4620 
tcgacgctgc ttctgaacgt gatcctcgcc tggaacgagg ccttctggac gctgcagctg 4 680 
accacctcgc gggcggcccc gctcacgcag ttcatcgcga gctattccag ccccgagggc 4740 
ctcttctacg ccaaactgtc ggcggcctcg accatggcca tcgcgccgat cctgatcctt 4 800 
ggctggttca gccagaaaca actcgtccgc ggcctgacct tcggcgcggt gaagtgagga 4 860 
ccacatgggc aagataaccc tgcgcaacgt ccagaagcgg ttcggtgagg cggtcgtcat 4 920 
cccctcgctc gacctcgaca tcgaggatgg cgagttcgtc gtcttcgtcg gcccctcggg 4 980 
ctgcggcaaa tccacgctcc tgcgcctgat cgcgggcctc gaggatgtgt cggacggcca 504 0 
gatcatgatc gacgggcgcg acgccaccga gatgccgccc gcgaagcgcg gcctcgccat 5100 
ggtgtttcag agctacgcgc tctatccgca catgacggtg aagaagaaca tcgccttccc 5160 
gctgcggatg gcgaagatgg agccacagga gatcgagcgg cgcgtgtcga acgcggccaa 5220~ 
gatcctgaac ctcaccaact atctcgaccg ccgccccggc cagctctcgg gcgggcaacg 5280 
gcagcgggtg gccatcgggc gcgccatcgt gcgcgagccg gcggccttcc tgttcgacga 534 0 
gccgctctcg aacctcgatg cggcgctgcg ggtcaacatg cggctcgaga tcaccgagct 54 00 
gcaccagtcg ctcgagacca cgatgatcta tgtcacccac gatcaggtcg aggccatgac 54 60 
catggccgac aagatcgtgg tgctgaacgc gggccggatc gagcaggtgg gctcgcccct 5520 
caccctctac cgcaatccgg cgaacctctt cgtggcgggc ttcatcggca gcccgaagat 558 0 
gaacctgatc gaggggcccg aggccgccaa gcacggcgcc accaccatcg ggatccgccc 564 0 
cgaacatatc gacctgtcgc gcgaggcggg ggcgtgggag ggcgaggtcg gcgtctcgga 5700 
acatctcggc. tcggacacgt tcctgcatgt igcatgtcgcg gggatgccca ccctcaccgt 5760 
gcggacgggc ggagagttcg gcgtccatca cggcgaccgg gtctggctca cgccgcaggc 5820 
cgacaagatc caccgcttcg gcgccgacgg aaaggcgctc tgacatgcgg ctcgacggca 5880 
agaccgccct catcaccggc tcggcgcgcg gcataggccg cgccttcgcc gaggcctatg 594 0 
tgcgtgaagg cgcgcgcgtg gccatcgccg acatcaacct cgaggcagcc cgcgccaccg 6000 
cggccgagat cggccccgcg gcctgcgcca tcgccctcga cgtgaccgat caggccagca 6060 
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tcgaccgctg cgtggccgag cttctcgacc gctggggcag catcgacatc ctcgtgaaca 6120 

atgcggccct cttcgatctg gcgcccatcg tcgagatcac ccgcgagagc tacgaccggc 6180 

tgttcgcgat caacgtctcg ggcacgctct tcatgatgca ggcggtggca cgggcgatga 6240 

tcgcgggcgg ccggggcggc aagatcatca acatggcaag ccaggccggc cgccgcggcg 6300 

aggcgctggt gggcgtctat tgcgcgacca aggccgccgt catctcgctc acccagagcg 6360 

cggggctgaa cctcatccgc cacgggatca acgtcaatgc catcgccccg ggcgtggtgg 6420 

acggcgagca ctgggacggg gtggatgcga agttcgccga ctacgagaac ctgccccgcg 64 80 

gcgagaagaa gcgtcaggtc ggcgcggcgg tgcccttcgg ccgcatgggc cgcgccgagg 654 0 

acctgaccgg catggcgatc ttcctcgcca cgcccgaggc cgactacatc gtggcccaga 6600 

cctacaacgt ggacggcggc aactggatga gctgaggccc aaggcccggc cctccccccg 6660 

tcgaacgcgc cccctatccg aggtaatccc atgacccgct ccgtcacccg tccctcctat 6720 

gaccgcaagg cgctcactcc cggcatcgtc catatcggcg tcggcaactt ccaccgggcg 6780 

catcaggcgg tctatctcga cgatctcttc gcgctgggcg agggccacga ctgggccatc 684 0 

ctcggcgcgg gcgtccgccc gaccgatgcg cggatgcgcg aggctctggc cgcgcaggac 6900 

aatctctcga cggtgatcga gctcgatccg gcgggccacc gggcccggca ggtgggggcg 6960 

atggtgggct tcctgccggt cgaggccgac aatgcggccc tgatcgaggc catgtcggat 7020 

ccgcgcatcc gcatcgtctc gctgaccgtg accgagggcg gctattatgt cgatgcctcg 7080 

ggcgccttcg atccgacgca tcccgatatc gtggccgatg cggcccatcc tgcgcggccc 714 0 

gcgaccgcct tcggcgcgat cctcgccgcc ctccgcgccc gccgcgacgc gggggttaca 7200 

cccttcaccg tgatgtcctg cgacaacctc cccggcaacg gccatgtcac ccgcaacgcc 7260 

gtggtgggcc tggccgagct ctacgacgcc gagcttgcgg gctgggtgaa ggcgcaggtg 7 320 

gccttcccga acggcatggt cgaccgcatc acccccgcca ccggcccgca cgagcgcgaa 7380 

ctggcgcagg gcttcggcct cgccgatccg gtgcccgtca cctgcgagcc gttccggcag 74 4 0 

tgggtgatcg aggatcattt ccccgccgga cgccccgcgc tcgagaaggt gggcgtgacc 7500 

ttcaccccgc atgtccatgc ctacgaggcg atgaagatcc gcatcctgaa cgggggccat 7560 

gcggtgatcg cctatccgtc ggcgctcatg gacatccagc tcgtgcacgc ggccatggcc 7620 
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catccgctga tcgcggcctt cctgcacaag gtcgaggtcg aggagatcct gccccatgtc 7 680 
ccgcccgtgc ccgacaccag catccccgac tatcttaccc tgatcgagag ccgcttctcg 7740 
aaccccgaga tcgccgacac gacgcgcagg ctctgcctcg acggttcgaa ccggcagccg 7800 
aagttcatcg tgccgtcgct gcgcgacaat ctggcggcgg gcacggtgcc gaaggggctg 78 60 
gtgctgctct cggcgctctg gtgccgctac tgcttcggca cgacggactc gggcgttgtg 7 920 
gtcgagccga acgatccgaa ctggacggcg ctgcaggacc gggcgcggcg ggcgaaggag 7 980 
acgccggccg agtggctggc gatgaccgaa gtctacggcg atctggcgca gaacgatctt 804 0 
ctggcggccg agttcgcggc agccctcgag gcggtctggc gcgacggggc cgaggcggtg 8100 
ctgcggcgct tcctcgcggc ctgatccgca gggcccagcc gctcggagca ccgaagcgga 8160 
gcccctgccc cttgcggcgc accgtgaggc gaaacgaccg ggccaccccg gggccaccgc 8220 
ctcggtaaca ccatggtatc gcgcaagaat gccggcgcct ctgccgaacg ggcccggctg 8280 
ccgggcgagg cgccggactt gtcaaggcgg. cggccctcgg gtagagaggg cgggcgtggc- 834 0 
cccgttagca cagtggtagt gcagcgctct tgtaaagcga aggtcgttcg ttcaaatcgg 84 00 
acacggggca cgcgatcctc cctccgcatc ggcgctcgcc cccggtctgg actgcctctt 84 60 
cggaaggcac ctgcccgctt gtgcgccgcg ccctttcctc gcttcccaag cgtctgtcac 8520 
ggcttgcgga aagccgtgcg cctcggttct ggacagccgc cccttgcggt gtaatctgcc 8580 
ctcagcgcgc agccggcgga cagaagccgg cccgccacgt ccacaaggga ggaatgccat 8 64 0 
ggatcgtcgt tcattcatca ccaaggccgc cgtgggaggg gccgccgcga gcgccctcgc 8700 
cgcgccggcg cttgcccagt ccgcgcccaa ggtcacctgg aggctcgcct cctccttccc 8760 
gaaatcgctc gacacgatct tcggcggcgc cgaagtgctg tcgaagatgc tctccgaggc 8820 
caccgacggc aacttccaga tccaggtctt ctcggcgggc gagctggtgc cgggcctgca 8880 
ggccgccgac gccgtgaccg agggcaccgt cgaatgctgc cacacggtcg gctactatta 894 0 
ctggggcaag gatcccacat tcgcgctggc cgcggccgtg cccttctcgc tgtcggcgcg 9000 
cggcatcaac gcctggcact accatggcgg cgggatcgac ctctacaacg atttcctcgc 9060 
gcagcacaac atcgtggcct tcccgggcgg caacaccggc gtgcagatgg gcggctggtt 9120 
ccggcgcgag atcaacaccg tggccgacat gcagggcctg aagatgcggg tcggcggctt 9180 
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tgcggggaag gtgatggagc gtctgggcgt cgtgccgcag cagatcgcgg gcggcgacat 9240 

ctatccggcg ctggagaagg ggacgatcga cgcgaccgaa tgggtcggcc cctatgacga 9300 

cgagaagctc ggcttcttca aggtggcgcc ctactactac tatcccggct ggtgggaagg 9360 

cggcccgacc gtccatttca tgttcaacaa gagcgcctac gaggggctga ccccggccta 9420 

tcagtcgctg ctgcgcaccg cctgccacgc ggccgatgcg aacatgctcc agctctacga 94 80 

ctggaagaac ccgacggcga tcaagtcgct ggtggcgcag ggaacccagc tcaggccctt 954 0 

cagccccgag atcctgcagg cctgtttcga ggccgcgaac gaggtctatg ccgagatgga 9600 

agcctcgaac cccgccttca agaagatctg ggactcgatc aaggccttcc gctccgagca 9660 

ctacacctgg gcgcagatcg ccgaatacaa ctacgacacc ttcatgatgg tgcagcagaa 9720 

cgccggcaag ctctgagccc gagcgccgcg cgaaagagga ccccggagcc gcgttccggg 9780 

gtcttttcat gggcgacagg ggccggcgcg 9810 

<210> 36 
<211> 1886 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<400> 36 

tgagtgtcta ttttttttcg ggttttttt a agtgtgaatc acatggttag gagcagttgt 60 

cttcaatgtg accaaccatc ccaaggctct aattcaacgt ttgggtgtgg gggcccgctg 120 

gcagctgtgt gtgccactgg gctgttggtg ttggtgcttt actccccctc atcgcaaacg 180 

gctaattggt cggcacaggg tatttccaca aaggcgctgt atccggcagt gcctgtgcct 240 

tccactctgc tgcctggaag cgcgcctgcc aaacaccagc tgcatgtttg gagggcacat 300 

gcgatgtcgg aggccacaac aaacaattca ttcaaacagt cattatttgg gtacaatgcc 360 

atctcctcca tttggcttca actggctggt gtggccgcca ctttctttgc atttggagct 420 

ttgatggcag ctgtaacgca acgcaaggag atcgccgtct tctccgcctc gggtcaggct 480 

gctgagccgg agggggcgga gcccctgaag cggccttttc cgtctcctgc tgccaaacct 540 

aagccgctct tctccacccc ggcaaattcc ttcagcaaca tcttccaggc gcctccatcg 600 
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ctgcgcacgg actccaccta tggccgaggc ccgcgctcga ccagcttcac cgacatcagc 660 
aactggccct ccaacaacgc actccgcaac ccccagtcgg tgattgacat cgggggagga 720 
gtcgacttcc tgggggacag aagccctgga aacccgttca cgcggctgcg ggggtccccg 780 
agctccaccc tcagcaacct cggcatgggc ctaggcctgg ggctgggcaa gggcaagggc 84 0 
ttcggcaagg gcttcggcaa aggccggggg ttccccgtgg aggaggaggt ggaggaggag 900 
caggaggtgc tgtcgtgggc cgaccgccgg cgggcgctgg cggaccccga cgccccgccg 960 
atgaacgagg acatcaagta cccgcagctg cggctggtgc gggccgtgcc gggcggccgg 1020 
gacgagaagc tcggtgtgat gtcgaggcag gaggcgctgg agctggcgga ggcggaagac 1080 
atcgacctcg tcctcgtcag catcgacacc gaccccccgg tggccaagct agtcaattac 114 0 
tcgaagttga agtacgagtc cgagaagaag aagaaggaca gccacaagaa ggggaaggtg 1200 
aaggaggtga aggagctgaa ggtgtcccat aagatcggcc agcacgacta cgacgtccgc 1260 
-gtgaagcagg cccgaaagtt cctggagggc ggccaccgca tcaaggtgtc gatggagtftc 1320 ~~ 
aaggggcgcg agaaccagtt cgtggagatc ggccgcgcgg tgatgaagcg cttccagaac 1380 
gacctggcgg acatgggcaa ggcggacgcc gtgcccaaga agctcggcac ccggctgatc 14 4 0 
ctgaacctgg ccccggccgg ggaggcgctg aaggtgattg cggagcggag ggcagagcgc 1500 
gacaggaaag ccgcggctga ggaggagggg gagggcgacg acctcgactt cgtggacgag 1560 
aacgaggacg aggatgtgga gggggagggc gaggaggaag aggccgagga gctggaggag 1620 
gagacagcgg aggggacgga ggtgccaacc cgcagctgat cgccgatccg cgggggacag 1680 
ccacctcccc cccggcctcc ctgccggggg ccggcaccat ccgtcgttgc ggtgcggcgc 174 0 
tgccatcaac ggccgtcctt gagcttaatg ctcccgccct ccgttggccc gcggcggtcg 1800 
ccaggttgct ggcctggctg cccgcagctc ctcccctccc cgactgacac agtgtggatg 1860 
accgtgatgt gcgccttttc gccttc 1886 

<210> 37 
<211> 3015 
<212> DNA 
<213> Unknown 

<220> 
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<223> Description of Unknown Organism: Unknown 
<400> 37 

ccgctcatct ccaggcctcc ctgagtgcgt acccgagagc ggcaagtaga gaaaggaaca 60 

cagatacagc accatggcct ctaggctcgt ccgtgtgctg gcggccgcca tgctggttgc 120 

agcggccgtg tcggtcgacg cgcgcttcgt ggtgcgcatg gtgcaggtgg tacaccgcca 180 

cggtgcgcgc agcgcactca tcgacgacaa cacgacggag atttgtggca ccctgtaccc 240 

gtgcggtgag ctgaccggcg agggtgtcga gatggtccgt gctatcggcg agtttgcccg 300 

cagccgctac aacaacctct cattggtgga gagccctctc ttcccgtcga cgcggtacaa 360 

ctcctctgtc gtgcacacac gctccaccca cacccagcgc accatccaga gcgcgaccgc 420 

ctttctgcgc ggcctcttcc aggacgacta cttctacccg gtggtgtact cgaccaacag 480 

aacgaccgaa acgctgctca gcactgacgc ggtgccgtcc gtggtgggcc gtagctggct 54 0 

cgacaacccg gcgctgcacg ccgccctcaa cccggtgatc gatgagcacc tcagctggga 600 



cgccatccag agcgctgcca aggacgcatg ggtcgagggc ctgtgcgcgg actacaacgc 660 
ccgcaccaac tgcgtcctcg acatgtacga cgtggccgcc gccttcgagg ccgccgggcg 720 
tcttgacaat gccaccaatc tcaaggcggt gtatcccggc cttcaggagg tgaacgccgc 780 
ctggttcaag tatgtcttca gctggaacca cacgagcaag ctcgatctca cgcagggctc 840 
cgcctcgcag aaccttgcgc agacggtgct ggccaacatc aacgcccacc gcctctctcc 900 
gtcgtacaac atgttccagt acagcgctca cgacacaacg gtgactccct tggctgtcac 960 
gttcggtgac cagggcgaga cgacgatgcg tccgcccttc gcggttacca tcttcgtgga 1020 
gctgctccag gacaccgcag atgccagtgg ctggtacgtg cgcctcatcc gcggcaaccc 1080 
tgtgaaggca gccgacggca cctatgtctt ccaggagtct ggtatcaagg catactgcat 1140 
cgatgaagcc gggaacaagt acctcgcaca caccggcatc tgcccgctga atagcttccg 1200 
ccgcatggtc gactactcgc gccccgccgt ggctgacggt cactgcgcca tgacacagac 1260 
tcagtacagc aacatggatt gcccgcgcac tatcgcggac aacaagccgg tgccgtcgcg 1320 
ctgctggctc taccgccacg tttgccctag caaggcatgc ccggacagct acattctctc 1380 
cgcggtcgac caccagtgct accccgggcc cgacgttacg aaccccacca gcagcagcag 14 4 0 
cagcgagggt accaccacca gcagcagcga gggtaccgcc accagcagca gcgacgttac 1500 
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caccaccagc agcagcgagg gtaccgccac 
cgagggtacc gccaccagca gcagcgacgc 
caccagcagc agcgagggta ccaccagcag 
cgctaccacc accagtagca gcgagggtac 
cagcagcgag ggtaccgcca ccagcagcag 
cgccaccagc agcagcgacg ctaccaccac 
cagcgacgct accaccagca gcagcgaggg 
cagcagcagc agcgagggta ccaccagcag 
cgttaccacc accagcagca gcagcgaggg 
cagcagcagc gagggtaccg ccaccaccag 
gggtaccacc agcagcagca gcgacgctac 
"cagcagcgac gctaccacca gcagcagcag 
taccaccagc agcagcgagg gtaccgccac 
cagcgagggt accaccagca gcagcagcga 
caccaccagc agcgacgcta ccaccagcag 
cgacgctacc accaccagca gcgacgttac 
caccagcagc agcgacgcta ccaccaccag 
cgagggtacc accaccagca gcagcagcag 
cccttccttc aaaaagcccg cgaactggag 
ccacattgcc ggggacatca tccgccgcgt 
ccgaaagcac gatgagtaca gccggcaccg 
ttgtgcatgt ttgaatggag aggaggcttc 
cacaaccgct cgtttcaccc ccgtcatctt 
ctttgtgggc tctctctggg tgtgggcgct 
ccatcgctct tacctgtggg ctattttttt 
cgcgtccctc tgcag 
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cagcagcagc gacgctacca ccagcagcag 1560 
taccaccagc agcagcagcg acgctaccac 1620 
cagcagcgac gctaccacca gcagcagcga 1680 
cgccaccagc agcagcgacg ctaccaccac 174 0 
cgacgttacc accaccagca gcgagggtac 1800 
cagcagcagc gagggtacca ccagcagcag 18 60 
taccgccacc accagcagcg acgctaccac 1920 
cagcagcgac gctaccacca gcagcagcga 1980 
taccgccacc agcagcagcg acgctaccac 204 0 
cagcgacgct accaccagca gcagcagcga 2100 
caccagcagc agcgagggta ccgccaccac 2160 
cgagggtacc accagcagca gaaigtgacgc 2220 
caccagcagc gacgctacca ccagcagcag 228 0 
cgctaccacc agcagcagcg agggtaccgc 234 0 
cagcagcgag ggtaccacca gcagcagcag 2400 
caccaccagc agcagcagcg agggtaccgc 24 60 
cagcgacgtt accaccacca gcagcagcag 2520 
cagcagcaaa agcacaagtt catcggatgt 2580 
cccgcgcgtt ctctcgccgg aaaggggccg 264 0 
gacgaacggt gttacgatcg gtgcgggtgt 2700 
ccaacagtag cacaacggca tgtaactctt 2760 
tgtacagcgt acattgtttc gagaaggtat 2820 
ttcattttga tctccgtcgt ctcatactgc 2880 
tgtgcgtgtg tcgctgtaaa gtcgttgacg 2940 
aattatggtt tattattact tccctctctg 3000 

3015 
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<210> 38 
<211> 38186 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 38 
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tagctggtcc ctaaagggaa cgtggtgtaa ccaccacaca gaacgaagct ggaacgttcc 1320 
tgccgtcctt agaagctgcc tttgctaagg ggaattgccc tgacttccca caccattgat 1380 
tcatctccag acccttggtt ttcatgttga tttttcaaaa atcacctgat agtctgaccg 1440 
aatgtagctt tccactggtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgagagagag 1500 
atggagtctc gctctgtcac ccgggctcca gtgcagttgt gtgatcttgg ttcactgtaa 1560 
cctcctcctc ccgggttcaa gagactcgtg cctcagcctc ccgagtagct gggattacag 1620 
gcacccgcca ccacacccag ctaatttttt gtatttttag tagagatggg gtttcaccat 1680 
gttggccagg ctggtctcga actcctgaca tcaggcgatc cacccacctt ggcctcccag 1740 
agtgctggga ttacaggtgt gagccaccac gcccggcctt atttttcccc cattttcttt 1800 
tttttttttt ttgagtcagg gtcttgttct gcgctcaggc tggagggcag tggtgtgggg 1860 
atcacggctc actgcagcct cgacttcctg caccaccacg cctggctgtt tttttttttt 1920 
ccggtagaga cgggggtctt accgtgttgc" ccaggctggt ctagaactcc tgggctcaag 198 0 
cgatcctccc gcctcggcct ccgcaaatgc tgagatcaca cgcgtgagcc cccgcacccg 204 0 
gcctcctttc caccgctctt gtctacagcc gcccctcctg gtccgattgt attggcagat 2100 
gtcgccaata cggtgtcaaa cggcgaaggg gcactgagcg ttttttcttt ctcccgtcct 2160 
tggcggcagc agctcggttc cggctacggg gctgagcccg tctctcagac gaggaaactg 2220 
gggtccgaga ggtgagccgg tcccagaggc agggcgaggg ggaagcggga gtggggtccg 2280 
cagcggaccc agccctgcct cccccctgca ggagatcgtc aacttcaact gccggaagct 2340 
ggtggcctcc atgccgctgt tcgccaacgc cgaccccaac ttcgtcacgg ccatgctgac 24 00 
caagctcaag ttcgaggtct tccagccggg tgactacatc atccgcgaag gcaccatcgg 24 60 
gaagaagatg tacttcatcc agcacggcgt ggtcagcgtg ctcactaagg gcaacaagga 2520 
gatgaagctg tccgatggct cctacttcgg gggtgagctt gaggggggcg cgcctggagg 2580 
gggagggggc acgcgacccc cgcggtgtgc agagccaggg ggccggggcc ggggccgggg 2 64 0 
ccggggatgg ggatggggat ggggatgggg ccggggatgg ggatggggat ggggatgggg 2700 
ccggggatgg ggatggggat ggggccgggg atggggatgg ggccggggat ggggccgggg 27 60 
atggggccgg ggatggggcc ggggccggca ccagggagag cctgggtggg aagcgcccac 2820 
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gctggccaag gtgcagaggc cgggccgtgt gcctgggcgg ggagggccgc ggcgcccgcc 2880 

tcgtccagca acccccccct gcgcgccacg tgcagagatc tgcctgctca cccggggccg 2940 

ccgcacggcg agcgtgcggg ctgacaccta ctgccgcctc tattcgctga gcgtggacaa 3000 

cttcaacgag gtgctggagg agtaccccat gatgcggcgc gccttcgaga cggtggccat 3060 

cgaccgcctg gaccgcatcg gtgagcgggc cgggggcgtg gccggggcgg gtgccctggc 3120 

gggggagggg cgtggccaag gcatcaggag agtggcttgg acagtggcag ggggaagggc 3180 

gtggctgtgg catcaggggc acggttgggg cagagacgtg gccaaggcat caggagtgtg 3240 

gccatggcag caggggcgtg gctggggcag gggcagcggc tggccgctcc taggacccct 3300 

ttgggtctag aggctgattt tctgacctat tgtcctactt cagccagagg cagcctgttt 3360 

cccaagggag ggaatgcaca gggtgtttgc ggttgtgccg aatgctcggt gagcacctgc 3420 

tgtgtgctgg gggtgcaggg gacagacccg ggggcccact cagactccca gggaggctta 34,80 

tggactggtg atgaaatcac acacgactgg gctgtgtgcc agcagggcag gtggggccgg 3540 

tgggcttccc tgagttggga atgcagagtg gagaccaggg taagggatgc catgtggaaa 3600 

cggggaggaa gatgtgttcg tggagtggac acagcacatc ccaaggccct gaggtggaaa 3660 

agaggcctag agtccagaga gccagggagg cctggaggag gttggggaag aaggggaggc 3720 

cagacacaca gggcccagtg ggcggcaggg agagtttaga ctaaatcagg agcatcaggg 3780 

agccatggag ggttctaggt gggcggagga cctggtcaga ttgtatccgc caaggcgggc 3840 

cgtgtccagg agggagacgg tgacctggcc tctcaggggg gcagtctctg gggcagggag 3900 

cggcagagcc ctgatgactg gatgtaggcg ccagagagat ggcggctcat gctgctgttc 3960 

gtgggaatgg gaatgaagac catggctgaa acgcaggaca ggtgcgacgg agtggtgtca 4020 

gggagctccc tggtgtacag taggaagctc tccacaactt gctctataca gtgagtatgc 4080 

aacccgttcc tgagtatcag gtgcttaggt tataacttct gtatacagca ggtgctcagc 4140 

acaggctgtg tacaggcagg tgttttcggt atgcctgtgg cacactggag gcagtcatta 4200 

cataatcagc gtatacaggt ggtacacatg catacttggt gcacagtgat acctgctcca 4260 

tgtacacagc aggcattaaa tacctgttta ctgccaggcg cggtggctca cgcctgtagt 4320 

cccagcactt tcggaggcca aggtgggtgg atcacgaggt caggagattg agaccatcct 4380 
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ggctaacatg gtgaaacccc gtctctacta aaaaaaaaat acaaaaaatt agccgggtgt 4440 
ggtggcgggc gcctgtagtc ccagctactc gggaggatga ggcaggagaa tggtgtgaac 4 500 
ccgggaggtg gaccttgcag tgggccgaga tcgcgccact gcactccagc ccgggcgaca 4 560 
gagcaagact ccgtctcaga aacaaagcaa aacaaaagcc ctgctttctg tatgcaggtg 4 620 
cttcatgcat gctggctgtg catagcaggt gctcagcctg tatatggcag gtactcaata 4 680 
tccatactat aggccagaga tgctacatat gtgcttattg tatacagtag gtggtaaatg 4740 
catgcttgct ctacacggca agcactgtgt gcgcacccgc ggtgcagagt aggtgctcgg 4 800 
tgcccgctgt acgcagcagg cgctccctgt gcacacgcta acgccccctc tcccgcaggc 4 860 
aagaagaatt ccatcctcct gcacaaggtg cagcatgacc tcaactcggg cgtattcaac 4 920 
aaccaggaga acgccatcat ccaggagatc gtcaagtacg accgcgagat ggtgcagcag 4 980 
gccgagctgg gtcagcgcgt gggcctcttc ccgccgccgc cgccgccgcc gcaggtcacc 504 0 
tcggccatcg ccacgctgca gcaggcggcg gccatgagct tctgcccgca ggtggcgcgg 5100 
ccgctcgtgg ggccgctggc gctcggctcg ccgcgcctcg tgcgccgccc gcccccgggg 5160 
cccgcacctg ccgccgcctc acccgggccc ccgccccccg ccagcccccc gggcgcgccc 5220 
gccagccccc gggcaccgcg gacctcgccc tacggcggcc tgcccgccgc cccccttgct 5280 
gggcccgccc tgcccgcgcg ccgcctgagc cgcgcgtcgc gcccactgtc cgcctcgcag 534 0 
ccctcgctgc ctcacggcgc ccccggcccc gcggcctcca cacgcccggc cagcagctcc 54 00 
acaccgcgct tggggcccac gcccgctgcc cgggccgccg cgcccagccc ggaccgcagg 54 60 
gactcggcct cacccggcgc cgccggcggc ctggaccccc aggactccgc gcgctcgcgc 5520 
ctctcgtcca acttgtgacc ctcgccgacc gccccgcggg cccaggcggg ccgggggcgg 5580 
ggccgtcatc cagaccaaag ccatgccatt gcgctgcccc ggccgccagt ccgcccagaa 564 0 
gccatagacg agacgtaggt agccgtagtt ggacggacgg gcagggccgg cggggcagcc 5700 
ccctccgcgc ccccggccgt cccccctcat cgccccgcgc ccacccccat cgcccctgcc 57 60 
cccggcggcg gcctcgcgtg cgagggggct cccttcacct cggtgcctca gttcccccag 5820 
ctgtaagaca gggacggggc ggcccagtgg ctgagaggag ccggctgtgg agccccgccc 5880 
gccccccacc ctctaggtgg cccccgtccg aggaggatcg ttttctaagt gcaatacttg 5940 
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gcccgccggc ttcccgctgc ccccatcgcg 
gcgcgcgtcc cccggtgacc tcggggagca 
agaggcaggc ctggctgcgc agggcgcggg 
atgtactgac gagccgaggc agcagtgccc 
ccacaccccc attccgcgca ataaacgaca 
tgcccgagac ccgcagggcg tgcacccttc 
agagctttat ttacacactg acaaggctca 
gcttcacctg ctccaggtcg aaggcccctg 
cgggaaagcc ccgccctggc ccgcagttcg 
acctggcttg ggcaccgcct gcagtgtctc 
gggcctgggg ttggaagcag ggtggggtga 
aatccaggta gttggggtca gggagcgcct 
ctggacaggt cctgcaggat gggctcgctg 
tgggcaggag tcagaggatc cccaggggtg 
caacgcccca gtgtgggggc cccacccatg 
ctggttcatg acggagacat cagctgcgtg 
ggtcaggccc ttcctgtggc agagcggagg 
acccagtgac cagcatcctg gccctgcgct 
ccaccctcct gcaggcctcg ccccacccct 
aggtatcagt acagggggag gaaatgttcc 
ccccacgctg ctgtgggagc ctcagctccg 
gccaccaccc cgcatcctga gcattcccag 
acctccacgt gctccagatc taaccacaca 
gggcagcgcc catgctcggc tctccctgtc 
ctacattcgg ggcacacagc ctcagggcct 
actctgccca gtccctgccc ccagctagac 
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ctcacgcaat aaccggcccg gcccccgtcc 6000 
gcaccccgcc tccctccagc actggcaccg 6060 
ggggaggctg gggtcccgcc gccgtgatga 6120 
ccaccgtggc cccccacgcc ccattaaccc 6180 
gcattggcgc caagcctggc cgcgtgtgat 624 0 
ctgaagacag tggctcctgg gggtggcaaa 6300 
cggggtgtca gctgaagaag taggtggaac 6360 
cggaggaagc agagcggacg gcgtgggtgg 6420 
agccaccctt gcgaggctgc ccacccgcct 64 80 
cttcagctgg ctggcctcca agatcttctg 6540 
ggctgaggcc aggttttggg gtgggggggg 6600 
tactcagagc agaaccgctt gaccaggaat 6660 
tgcaagcgga caaactgctc ccggcacacc 6720 
atcaggcagg ctctgggcac cacccctacc 6780 
ggtggactga ggctcagact acgggggcac 6840 
agtccagtaa cagtcgtgca cagagacgaa 6900 
actcctgaag ggaggggagc tcacagggcc 6960 
cagccccctc cacttgaggt ccagggaagc 7020 
cgccccgccc ctccccaaac atcctgggtt 7080 
cagaagcctc ctcgccccac ccctgccgcc 7140 
agggcggcta cgaggtcccc tcctgccagg 7200 
ctcccgtggc cggtagattc tgctggaacg 7260 
tcgcggtgcc aagaaatgcc cagcaggaag 7 320 
gggccacagg aggggagctg ccaggaccac 7 380 
ctacacaggc cccacagaca cagcagatcc 7 4 40 
ccagccttgc cagctgtgcc ctgctagcca 7500 
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gaagacgccc 
tgcagggcgg 
ccgttcttct 
gcccagggac 
caggccccag 
gctctgaatt 
gcccagcacg 
ggtggcacac 
tacgctgggg 
ggatggggtg 
acgcccaggg 
tcggtcagcc 
cccgtccact 
gcccgggtcc 
gaggcctccc 
gcccgttcac 
agcagggggg 
ctaacaggca 
gggcacaccc 
ctggcgcacc 
gccatagcgc 
gatgaaacct 
cacctccacc 
ggccaccaag 
ggtg.agggca 
gggcagcagg 



ctgggaggcg 
tgagcatcat 
gcttacgtgt 
acccctaact 
cccgggcccc 
ccacctccta 
gtggtggtac 
tggcgcggga 
cactggtaca 
gtacactgac 
gtgtgaccca 
agtgctgtgg 
tgcttaggga 
ccgagaacat 
acacgaactc 
gccctactcc 
caggggaatg 
gccagtggtc 
gtctgagttt 
tggggaaagt 
gtgaccccgt 
tccagcacct 
tgcacggcgg 
cacccatgaa 
cctggggccg 
gcacacccta 



agcggcaccc 
gtgggaggag 
gttgggcttt 
ggccgctgtc 
ccactcaccg 
tttgctaaaa 
attgaggtgg 
tggggtggca 
ctgggacgct 
cttgaccttg 
ctccaccaca 
gacacaggcc 
gtcctggccg 
ctcctgtaga 
ctgcagaggg 
cccctatttc 
gggcctggcg 
tgggggagca 
taaatggcag 
cgctcagctc 
acaccaccgt 
gtgccacccg 
gtgggccggg 
gcccccgccc 
agactcaggg 
cctgcgcggc 
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acgccgtccg 
tccagcgagt 
ctgaggacgg 
tccaccgtgg 
gctgatgtct 
aggggaaggg 
tggtacactg 
cactggggcg 
gttacactgg 
gagtccaggc 
gagcccatgt 
gtctcagggc 
agcggggaca 
ctcttgaaga 
cgggcagcag 
agagccactg 
cccacgcagt 
gccagggctc 
tgaaaccaac 
ccggaggcgc 
catcaccgtc 
catgccccgc 
ggcgcgggtc 
cagccccacc 
ctcacattgc 
cacgccgctg 



gagacgccca 
ggatgaagtt 
aacaggtgcc 
ctgctctcca 
ccgttgtggg 
gccggtgagt 
gggtagtggc 
gtggtacact 
gatggtggca 
gatagggctg 
gggagatgag 
agggggctca 
ggacaggacg 
cctggcgtac 
gtgcaggtcc 
aggcccaagg 
cagcaagaaa 
ctgctgggag 
gtgttcgcag 
ttctcaatct 
tgcttcacca 
tgggcgtcct 
agccccgcta 
acatcctcag 
ccccacgccg 
tacacgtcct 



cctgtagcag 7560 
gggcgggaag 7620 
ggtgggggcg 7680 
gacccccggc 7740 
tgtaggtgat 7800 
cccacccgag 7 8 60 
acgctaggat 7920 
ggggtggtgg 7980 
cactggggag 804 0 
gatgacgggg 8100 
gcgggcactc 8160 
ggccggggat 8220 
tacctggatg 8280 
gagatagtga 8340 
tcaggggctg 84 00 
cctagggcct 84 60 
cgcccaagcc 8520 
gctgggtcgg 8580 
cgcgacatgc 8640 
gcaggcgccc 8700 
ccttgcgggt 87 60 
gcctacggaa 8820 
gcagcccagg 8880 
gacaggccaa 8 94 0 
agatgccccc 9000 
gcggcacatc 90 60 
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cgagggctcc aggttgacgg aggcggcgcc cacgctgtcg cggcccagag cagcataatg 9120 
ctgcaggccg ttgcaagagc cgtcctgagg aaggggcggc aaacgggaga tggaagctag 9180 
agaggcagag acgtgtggga ccccaaacca ccccccaggt cgagccgttc ctagggccgt 9240 
gcacccccca gccaagtgca ccggagcccc cgcacgctcc cgggagagac caggagccat 9300 
ggctcccgca cactctagga ccacctccag agaataccac gagcgaaggt gaaatctcac 9360 
accctcaagt cgagccccag gcccagtgca cactgcacgg cctcgggggc cagacccagc 9420 
tggctcacct gatggacggg gaggtgggag acataggcgg cagggtcgga ggcgcgcaca 9480 
gcgttcgcca cctccataca gcaggccagc gtctgccagg gttcctccgc gcccatccac 954 0 
cactttcggc cctgcgggga cagcggatgg ggggcagtga ggcccgggcc cgatccctga 9600 
gcccgctggg aggctgtgtt gcggggaggt gggaaatggg gaggagacgc acacccgtga 9660 
tagtgaacac gggacgcatg tgggcgagag acggggcggt ggctggatga gttctccata 9720 
gccacggatg gaggatggga gctgcgggtg gaccgggctg aaacaagcgt gtccggagct 9780 
gccgggggag gagggtggac agaggacctg ggggcgccgg gggaggaagc agctcggcgg 98 4 0 
atgcagggga ggggggaacg tggggaacgc gggggccctg gggcagggga gaagggagaa 9900 
gcaggacggg cagggggcgc gggggaggag agcgggcggg ggacgtgggg gcgccagggg 9960 
agggggaggg gaggaggaag acgggcaggg ggcgccaggg gagggggagg ggaggaggaa 10020 
gacgggcagg gggcgcgggg gcgccggggg agggcgcggg ggcgccgggg gagggcgcgg 10080 
gggtgccggg agggcgggga atgcgggggc cccgccccta ccgtcaaggg ttggtccgcg 1014 0 
gagtccagga tgtcatccat cacctcctcc gcaaaggcca ggcgcttccg cagcggctcc 10200 
cgcttcttca accccgtgag attgaccagg tggatcttga gccaatccag gccgtgcggg 10260 
ccgagcgggc ggccctgggc gaactccagc agggcccgcg ccacgtcgct gcccaggtgg 10320 
ttgaagtgcg gcgggcaggg gtaggtgcgg ccgcggaagt ccatgttgtg cggcagccag 10380 
aagacgcggt cccgcaggtg ctgcgccagc gagaggcggt acagcgcctc cgcccgcagg 104 4 0 
ctgtgcatct cccgggccac cttctggcag tgcgccagct cacggcgcag ctcggccttg 10500 
cgggcgggcg cggcgctgtg cggcaggtgg gcctcgggcg gctggggcgc ctcggagggc 10560 
ggggccggca cgcctagctg ggggcagccc ttggcctgga agagctgcag caccaggtcc 10620 
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agcacgcgcc cgttgacgcg ccaggcgcag ttgcccagtt gggtgagggc gtccagtgcg 10680 
ccatgcagcg cggtgggcgg gcaggtttcc agcagctcct ggtgctgcgt ggcgccttcc 1074 0 
accgtgcgca tcagcttggt ggggctgagc aggaaagcac cagagtgcgg cgatgtccag 10800 
ggcagcgggg ggcaaagcat gggtacatcc accgcctcga aggtcagcgt gggctccgcg 10860 
gccttctcca gcagctgcac gtaggccggg tgcggcttca ggatgccgat ctggggtgcg 10920 
acaggcagac gggtcagggc cccggtgctg gggctttcct gttcccaccc cttaaacttg 10980 
ggtgagaggg gccggctccc cggccaacaa gaaaccagtg tggcctccca cgaacagaag 11040 
ccacctccag aaacggccgg acacctgcat ggacacccat ggtgtgtccc gagtcctggg 11100 
aggtactgac ggctgcgctg agatcaaggc tccgcccaaa ggcgccaacc ccatggggtc 11160 
cctggtcctc ccagcgggat gccccccagc tcaggagggc actgcctggc acctgctgga 11220 
cgttgcggaa ggaatacacg tggtagagca cggggacaag ccgagaggaa cgatgcggct 11280 
tgtccaggct gcatggcatc tgcgtagcct gcaccagcat ctccgccagc agcttgccca 1134 0 
gctccatctg cactggcagg ggccagggct gctcccgcag ggcctcgggc gcccccagct 11400 
cctcccagta ctgccgcggc aggcagggct cgggcacctg taggacaggg cggtcagggc 114 60 
gctgggcacc ggggcccctg agctagatgc cccaccgccc gtgcctgacg cccggtgggg 11520 
catctgtcag cccaagcata cagatgaaca gactgaagct tgggtgcaaa cccggctgct 11580 
ccagggaggg agagcgccca cccaccactg gccccagcca ggaggagagg gggtgcgagc 11640 
ctcacctcgg cgtcggaggc cagcaagcag aggtacttcc tgtagtggtt ctgcagcgcc 11700 
tgcacctggc cactgacccg ctgcctctgc accacgtgcc ggctgaaagt gcgcgcactc 11760 
agctcccggg ccagggtggt gaaggactca ccttgggcgg gcagcgcctg caggacctgc 11820 
ggaaggcagc cgtgagtgcc tgcccgcccc gcccggggac ccggccgcgc ggaggaagac 11880 
gcacctgcag gagcatccgc accacctcgc gctcgtccag caggcacagg aaggggtaaa 11940 
gtgagaaccg gccctcgtac acctcgcgct ctaggcggtt cttggtctcc cgcagcgccc 12000 
ggcacagtgc tttctcccat tggtcccgca gggtcttcag ggtcttccgc tgcgggggat 12060 
gaacgggccc ggtgagcccc gtggcagctg gtgggaccca ggctcacagg acgggggtca 12120 
ccgcagctcc ctgcagagac ctcatggccc tcaaggtccc tgctgtgtgt tccgggtagc 12180 
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tcctcacccc ggcctgccct ctgccggctt cagcgtgcct gacgcagcca agagcaaaag 1224 0 
cccagctgca gtgtgcgcag aagcacaggc caagacccaa cctcgggacc ccacaagttt 12300 
tccctgagcg gcagccaggc tgagttccta ggccctgcat gaccagacca gggcatgagc 12 360 
aattcaaccg catacacgga gctcagcccc tgcggcggac acgcgacccc ggctcagccc 12420 
ctgcggcgga cacgggaccc cggctcagcc cgtgcggtgg acacgcgacc ccggctcagc 12480 
ccctgcggcg gacacgggac cccggctcag cccctaccgc gtgcttgacc tccttgcttg 1254 0 
gcaacgtggg cttctccacg gacaccacgc acaccctgct ggccagctcc atgtggagct 12600 
gcttctcaaa gaggcactgc agggtcttca agggcaggtg cagcttcggg taggacacac 12660 
gcccatcctg cagggatggg ggtagtgagg ttgggggctt gccagagggc gacctgccct 12720 
cccaggaccc cgagacagca tgggtgcacg cgtttctgcg tctcctgcaa gttgctggtg 12780 
gctatcgctg acgcggggaa aggcgggctg cgggtaaagt cagtgccagc agtgcaaacc 12840 
aaaggccttg accctcctgg cctcgacccc tctagaaggg acactgggca ccgtgcaggg 12 900 
ggtggcaggg gcggtgatgc tgggagctgg cagagcctgg ggagaccgtt cactgcaccc 12960 
ccagatgttg gctgttttct cctcaaactc agaactgtat gaatgtgacc catccagaaa 13020 
tagatgaatt aaaaataaca actaaagcct agcgctttga gaatcaaaga cgcacgtcca 13080 
cataaaagct tgtacacaaa cgttcacagc tgcatgactc gcagtcgata agtagaaaca 13140 
gcccaacgtc ccataaacgg acgaacagac gggcacggcg cggccatcca cgcaccggag 13200 
catgactcag ccctgaccca ggtcgcctcc cggaggcacc atgaggacgt cacgctcagt 13260 
gggagatgcc aaacacaaaa ggtctcgcag tgtgtggtcc catttctatg gaatgtccag 13320 
agcagactca tccacagatg gggaggggat ggggagtgac ggggatgggg acgaggcttc 13380 
cttttagggt gatggaacat tctagaatta gacaaccgtg actacactaa aatcgctgaa 13440 
ttacaccttt aagagggttt tatggcaggt gaattacacc tcagtaacag. acgagcccac 13500 
tgcgtgcacc tggcagcccc actcaaacgc actgctctcc tgtcacccca ccctctctct 13560 
gcggcccccg accacctcgt ccccctgagc ccacaccctc . agggccaaga ccctcccagc 13620 
tctgggtcct cccatcttct cagaggagga agggaggaat tcagggccca gcccaggtga 13680 
gccctgggca ccggggaggc ccattggtct gagctgaggc tccaggaacc cccaaagggc 13740 



WO 99/37660 PCT/US99/013I3 

-162- 

agctataagg actgaagtct gccggggccc acgtgggctc accttggcat acacgtccct 13800 
gagcagcttg gaggtgttga ccgggggcgg cagctgcggc gggaggctga aggtgggctt 138 60 
caccttgtgc acggccttca gaacagtggc ccgatcctcc tcagacagca gaacggcggt 13920 
gaagagtgcc tgcagcttca gcccctcctg gctcatctgt tccagacacc tgtggtgcag 13980 
gcggcctgct cgagggacgg gccagcccca cgctgggctt ccacagaccc caggggaacc 14 04 0 
tcgtgaccac ctcctgctag cctgcaggtc tcggtgtggc tgtcaggccc tctgggggtc 14100 
cccagccccc agcccaggca ccgtcccaga tcttaaaacc ctgggaggga catggtgggg 14160 
ggtgggggcc ctcccgacac cacctacctt tcgatggtcc cggcgtcctg gtcctgcctc 14 220 
cccatgcact ggagggcagc cgcataggac agcaggtccg gagtcaagcc ggcatccttc 14280 
accatgaata acacatatac cagctccttg aaggcaccct gggagaccaa gccagggtga 1434 0 
gggtctgggg ggatggccca acctccacat cctccctgct ccctggagac cccttctctg 14 400 
tagccaccag ctcagcaggg gacagggtca ccaggcagga gtggccagct gggcagaccg 14 4 60 
atgcatcccc ctgaggttct gacacacaag ctccacctgc agaggcagcc gcatggcccg 14520 
ccaggtggga ctgtgggagg ttcacgttcc tctgggaggc agcttgttaa acctccagat 14 580 
ttgtcaattg tgtggatctt ttcaaaggac tgacttggct tgactgttct ctgctgtttc 14 64 0 
tgccttccat ttcatcgatt tgttttaatc tttgtaactt cctctcatct acttgcttta 14700 
ggtttagtga cagcttcttc ttctagtttc ctaaggtgaa aggtgacgta tttggtctga 14 7 60 
gatgtttcac tttttttccc cccaagatgg agtcttgctc tgttgcccag gctggagtgc 14820 
agtggcacaa tctcagctgg gccgggttct ctgcctccca ggttccagca cttctcctgc 14 8 80 
ctcagcctcc tgagtagctg ggattacagg cacacgccac cacaccagct aattttttgt 14 940 
attcttagca gatacggggt ttcaccatgc tggccaggct ggtctcgaac tcctgacatc 15000 
gtgatccgcc agcctcagcc tcccaaagtg ctgggatgac aggtgtgcac caccgcgccc 15060 
ggccatcacc tttccgaata taggcatttt gtgactataa attaccctgc* gagcactgtg 15120 
tcagctgcat cccaggactt ctgacaggtg gtgttttcat tttcattatc tccaagtgtt 15180 
ttcgaacttc atagtttact tcttctttgg aaattttatt taattatttt tttagataga 1524 0 
gtctcgctct gtcgcccagg ctggagtgca gtggcgcaat ctcagctcac tgtcaacctc 15300 
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tccctgtaat cccagccact caggaggctg cggcagggag aattattgaa cccaggaggc 16920 
ggaggctgta gtgagccgag atcatggcac tgcaccctag cctgggcaac agacagagag 16980 
tctatctcaa aaaaaaaaaa aaaaaaaaga tagaagcaat gccttagcct ggctaacatg 17040 
ctgaaacccc acctctacta aaaataaaaa ttaaaacaat tatccggggg tggtggcaca 17100 
cgcctgtaat cccagctgct cgggaggctg agctcgcagt ccagcgacat ccaggactgc 17160 
tggccacccc ggaacgctgg gagaggcagg aggggcccct gctagagcct ctggagagac 17220 
ttcgggtctg cagacatctt gattccagac ttctgggctc gtgctaagag tgcgtttctg 17280 
ctgtgcaagc cgccaggttt gggacacttt cgtaggggcc gatcccaaaa gcgccctgtt 1734 0 
acagtgtggg -ctctctgccc agggaatcca gggggcttgt gaccttggag gggaaaatac 17400 
acgaccctca tcctcagtcc tcccggagtc tggcgccccc tgcagcaagg aggaaccagg 174 60 
cagcacgccg cctccacctc gcggtaagag cactgcggac ttcaccgcaa gactggcccc 17520 
acctgatcct gaatttcgct gtttgatgcg ttaataaaga "agcacatcaa gttctctacc 17580 - 
acgaattggt cttaatattg cgatatctgt attttaatat aatagtatcc catgtttacc 17640 
caaatattaa gagaagcttt tactgttgtt tctcaaatta gggctgaagg atcatggggg 17700 
gggagaaagc tgggaacgtt tgctgctttg aaagggtgtg taaacaacac cctccaaaac 17760 
aaccaagagt tccgaggaga aactttggcc ggatacggtg gctcacgcct gtaatctcag 17820 
ctcctcggga ggctcagggg ggcagatcac gaggtcagga gtttgagacc agcttggcca 17880 
acacggtgaa acccccgtct ctactcaaaa tacaaaaatt aatcgggggt ggtggcgggc 17940 
acctgtaact ccagctactt aggaggctga ggcaggataa tcacttgaac ctgggaggtg 18000 
gaggtggcca tgagccgaga tcgcaccacc gcactccaac ctagtaacag ggagagtatg 18060 
tcccagaaaa caaataaata aacaaacaaa aagaaaacgg caagggaaat tggaaaatac 18120 
tccagatgaa ccacaacgaa gatgggtggg atacatctaa agctgtgctc agagggaatg 18180 
cggcgccagt gaacacccac atttcacaca gaaggatctc agcacagcag cccgaccttc 1824 0 
cacctcagga aaccagaaaa aggagcaaag tcaaccccaa caccaaagcc tcatcctgac 18300 
gagggctctg caggctgccc cccgacgagg ccaaaagcac ccctgcccag acagattcac 18360 
gagccccgag aaagaacgga aggaaatgct caaggcatta gcagaatttc tccctacttt 184 20 
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tttggtcatt 


ttcaaaattt 


gagagtcaca 


cgtgatttgt 


atttgaaaag 


cctaaaagaa 


18480 


ttattaaaat 


aaaaaacaaa 


ggacttgaac 


ctgggggcta 


agagagaaaa 


gtccagtcta 


18540 


aatgagggca 


agttcctgtc 


tccaacgacc 


agggcaggtg 


gcccggctcc 


cggctgcact 


18600 


cacctgccgc 


gcccagccaa 


gcatcacggc 


gttgtacatg 


tccagcgtga 


gcagcttccg 


18660 


cttctgccgc 


tggccgtggt 


ggacgaccag 


caggtggtgg 


gcgaggggca 


gctggtcagt 


18720 


gagcaggcag 


cacttgaaga 


aggccaggag 


cctctgctgc 


tgacctgaga 


gctgggcctg 


18780 


cgagtgctgc 


cccgacgggg 


cctgctccac 


atcgaggctc 


agcttcccag 


gggcctcctg 


18840 


cagcagccgg 


gccagctgct 


cctcccaggg 


gctctcgggg 


gcctggcgcg 


tgcagtcctc 


18900 


caggcacccg 


gccatctgct 


tgctcaggag 


ccggggctcc 


acctgcaggc 


gcctggtcag 


18960 


cgccttgaac 


tccccgctct 


ggaatggcat 


ctgcagcttc 


gccttcaacc 


gctgcatacg 


19020 


catctgccgg 


gtccgcttat 


ccttctccag 


tatctttgcc 


cagcggccac 


agggcaccgg 


19080 


ggtggcatcc 


ttggccccca 


tctggacctt 


cctgggtggc 


tggaggctac 


catctccact 


19140 


gccacattct 


gggagccgcg 


ccacatccac 


cctgttcacc 


accacctccg 


acacgctct c 


19200 


agcctgcagc 


tgccgcaccc 


gcgcctggag 


cactgtgagg 


ggcagaaggc 


gaggacatga 


19260 


gagggacccc 


ctccccattc 


gagcacccgt 


ctctctggac 


cctgagccag 


gccaggaggt 


19320 


gcaggtggct 


gagctcgctg 


ggacccaagg 


cgtgaattcc 


tcatacttgc 


caacaacgtt 


19380 


gt aaggtctg 


cccgctgctt 


tccagacaca 


cgcaccccac 


cacctccgca 


cctccccacc 


19440 


cgagcctcac 


agaactcagc 


agccctaaca 


agctgccacc 


gaaacctgca 


gcaccacgtc 


19500 


tccccggtca 


ctggccgctc 


agaccctcca 


ggtgcacagg 


cccagaaccc 


ggggtctgtg 


19560 


acaactccct 


ccgtccacct 


ctcagtacct 


cctctgggct 


tgcctccaga 


atctatccag 


19620 


gtggcccccg 


cctcccctgc 


ccctctcact 


gtctagctca 


gggcctctgc 


acagactccc 


19680 


aggaccctga 


accgcccact 


ccctggctca 


accatggcct 


gcaagttcgc 


accccgcctc 


19740 


agcaagaccc 


ccccagctgg 


tggagctgcc 


acacacacac 


tcctaggctc 


ccagtgtcta 


19800 


caccggtgga 


cgctgagcca 


ctagctcgca 


gggaaaacgc 


ggctcctgct 


cgtgccgcct 


19860 


caggttgcat 


ttttgccaac 


caatcaatgc 


ctaagtgttc 


tgtatctctt 


taaagaagcc 


19920 


ttgttggaaa 


tctattgctg 


gccgggcatg 


gcggctcacg 


tcggtcatcc 


cagcactttg 


19980 



WO 99/37660 



PCT/US99/013I3 



ggaggccgag 
gtgaaacccc 
tagtaccagc 
tgcagtgact 
taaaaaaaaa 
actccggtgg 
gtgatcaggc 
aagggggggg 
taggaggccg 
gtgaaacccc 
tagttccagc 
tgcagtgagc 
caagaaaaaa 
caaaactcag 
ttttgagatg 
ccgcaacctc 
cgattttcct 
gctaattttt 
actcatgacc 
gccaccgcac 
gtgctgggag 
gcccacctcg 
gttttttccc 
acaggatgtg 
ccacaccacc 
ccaagtttcg 



gcaggaagat 
gtctctatta 
tacttgggag 
caagatagcg 
agactgttga 
ctgaggtgaa 
caccgtgctg 
gggacccagg 
aggcaggcgg 
gtccctacta 
tactcgggag 
caagatcgca" 
aaaattgctg 
gagatacttt 
gaatttccct 
caccttccgg 
gcctcagcct 
gtatttttag 
tcttgatccg 
caggcctcgg 
tacaggcgtg 
gccacccaaa 
tttataaagg 
ggacagaagc 
agccaggcca 
cagtcatcca 



cacctaaggt 
gaaatccaaa 
gctgaggcag 
ccattgaact 
aataagccgg 
agaatcacct 
cagcctgaga 
tgtccagatg 
atcacgaggt 
aaaatacgaa 
gttgaggcag 
ccattgcact 
aaataaaaag 
taggattaac 
cgttgcccag 
gttcaagcga 
cctgagtagc 
tagagacagt 
cccgcctcgg 
acccttgacc 
agccaccgca 
agtgctggga 
ttctcccgcc 
accggcggga 
ggacgagggc 
gtctccatta 
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caggagttcg 
aaattagctg 
gagaattgct 
ccagcctggg 
gtacagggcc 
aagcctagga 
gacagagcag 
tggtggctca 
caggagatca 
aaattaaccg 
gagaattgct 
ccagcctagc 
acaagcgtga 
agttgagagc 
gctagagtgc 
ttctcctgtc 
tggcactgca 
gtttcaccat 
cctcccaaag 
tcttgatccg 
ccaggcctcg 
ttacaggcgt 
tctcccttcc 
agcaagcaca 
ctgccaccgc 
ggcgcctacc 



agaccagcct 
ggcgtggtgg 
tgagcctggg 
caacagaaca 
gcgcacctgt 
gttcctggct 
gaccctgtct 
cgcctgtaat 
agaccatcct 
ggcgtggtgg 
tgaactcggg 
aacagattga 
tgtccgcctt 
tttgttttgt 
aatggcatga 
tcagtctccc 
ggcgttcacc 
gttggccagg 
tgctgggatt 
cccaccttgg 
aacccccgac 
gagccaccgc 
cggctgccta 
gggaagctcc 
tggagcctgg 
ccccagagcc 



ggccaacatg 
catgtgtcta 
aggcagaggt 
ataatccatc 
ggtcccagct 
gctgtgagcc 
caaaaaaaaa 
cccagcactt 
ggctaacacg 
tgcgcgcctg 
aggcggaggc 
gaatccgtct 
cagagtgctc 
tttgttttgt 
tct cggctca 
cgggttcaag 
accatgccca 
ctggtcttga 
acaggcgtga 
ccacccaaaa 
ctcttgatcc 
acctggccag 
atggacgcag 
cacctccctc 
gctgtccctc 
aagccaggac 



20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640. 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 

21540 
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agctgagtca gttcagggtt cacatcctgg ctctgcacat gtggccttgg cggcggggcc 21600 
gggggggggg tctctccaga cataatcttg ggcctcacct atgtccctgg aaagtgggag 21660 
cacctggtgg ggttctgggg agggggaatt acgagagctc caggaaggag cctgctcagc 21720 
aaggacaggg cccatgagcg gtgcaagaga tgtttcagca acgccgtctg ggcgtgtcct 21780 
gggacccgag aggtggagac cgccctcagc ctgtctcaga atctgagcct ttgccttttc 2184 0 
tcccggcagc agggagcgga ctctcctctc ccgggccgcc gtgggggtcg cgctcaccct 21900 
ccagcagctc cacgtggccc cagtccttcc tgcggtcttg gtcttgctcc tgggggctgg 21960 
cggacgagct cctcctgggg ccgcagacgc caccggcggt ccctgcggga aagacgagag 22020 
cggctgagcg gggccgggcg tgtgggcggg ggcctccata aaggcagaag ccgaagggtc 22080 
gaagggcaaa ggagccctaa acgcagcgga aactctcgga gcacgggctt aagttggaaa 22140 
gaaactaaga cagcgaaggt ggaagggccc cgccgcggcg aacacgggcg cggaaccgcc 22200 
gagagagggt tcctcgcact cgaggtgcag caggtcaaag gttaagagcc ctaaacacca 22260 
cacctggggt caggaggctg cataagaaac cacgagtcaa aggtcagact gcacggagga 22320 
gcctcagtcg aaaagcgggc aagggcgagt ggaaagcggg gccgggtcgg tgggctgcgc 22380 
acgcccaggt gcaaagaggc aaaggtcaaa gcgccaaagg ccccggccgc gcggggagga 224 4 0 
gcccacgccg tggcccccgg gctgcctggc cgtctccctt tgtgttacct tctttgccgg 22500 
ggagtcccgg gcggccgcaa ggccgtaggg ctcgtttgag ccccgccgct ccgcggcccc 22560 
agcaaagtgc cgacattacg cacgccgctc caggccaccc caccggcccg cgcctgcgca 22620 
tgcgcccgcg ccgcctgccg ggagttgtgg tttcatggtc gacggaggct gcgaagggaa 22680 
accccagccg gaagtagact cccaggatgc agcggaggcg cgaaggcatg cgccggtgga 2274 0 
cgctctgatt ggttcctcct gctgttttta *aagggagggg gcgggacaga gctgttgccg 22800 
tggcaactgg gaggcactct caggctgttt tcccgaggac ctcaaatccg gacttttttt 228 60 
ctgtttttct ttcttttttg gttttgtttt ggacgcgttg tggcccaggc tggagtgcag 22920 
tggcgtgatc atagctcagt gcagcttcga actgctgggg taaagagatc ctcgcccctc 22980 
ggcttcccaa agcgctggga ttgcagacgc cgccaccgtg cccggctttt tttttttttt 2304 0 
tttcaaggca tactcatcta ataacgagga cagcatctgc aatttagaga ttcctgtccg 23100 
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caaccttcat tgctccaacg acaacttttg 
ggaggaagct gaggctcaga gagggccacc 
tcagggaggc tgcaaggaga gaaagcccca 
tttaccctcc aatccccaag ccctcagggg 
tcacctgcta agtggggtga ggtcaaggga 
cctggtctcc gcgtgatcct gcctaggagg 
gaaggtgtgc ccacctgtgt cgttcagccg 
tgtggggcag taggcttggg aagggtgaga 
ggctaatatc ccagttgcaa aggggagctg 
ctttgggagg ctgaggcggg cggatcacct 
tacgcaagca tgcctggcaa catggcaaaa 
ccgggggtgg tggcgggcac ctgtaatccc 
gcttgaaccc gggaggcgga ggttgcagtg 
gggtgacaga gcgagaacct gtctcaaaaa 
gcctcaggcc tgtaatccca gcactttggg 
ggagttccag acaagccttg ggcaaccgag 
aaaagaaatt agccaggcat ggtggtgcac 
ggcaggagga ctgcttgagc ccaggaggtt 
gtactccagc ctgggttgac agaacaggac 
cctgaggcag gaacaagcgt ggacagagga 
ggagcaaaga tgtagctggg gctcagttag 
tcagctcaag ctatcctcca tccccaaacc 
acgaggactc gacctcactc tggaagggcc 
ctgctgccga cgcctgcgtg agcgcactga 
ggcagcaccc tccctcacct gcctcagtcc 
cagacctcac ccactactgg ggtcacctgc 
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ggtaagagtc attaggatgc cgtctatcat 23160 
aagttgctgg aagacacagc acgtgcgacc 23220 
gtccgcgaga ctcccagcct ccagcttcag 23280 
caggagccga atggagcggc aggcttggat 23340 
atgaaataaa cctcggagcc tagagcctgc 23400 
agcagggcgg gagctttaga atggaacctg 234 60 
gggcagcagg ccagaggcgg gagcgcctgc 23520 
ataggaatat ctgggggtaa ctgtgttcca 23580 
gtttggtggc tcaggcctgt catcccagca 23640 
aaggtcagag ttcgagacca gcttggcaaa 23700 
ccccgtctct agtaaaaata caaaaattat 23760 
agctactcgg gaggctgagg caggagaatc 23820 
agccaagatc tcgccactgc actccagcct 23880 
aaaaaaagtg caaagggagg tcagttcagt 23940 
aggctgcggc gggaggatcg cttgagccca 24000 
atactgagac ccagtctcca ccaaaggaaa 24060 
acctgtggtc ccagatactc gggaggctga 24120 
tagactgcag tgagctgaga tggcgccact 24180 
cctgtctcaa aacaaaacaa gtgcaaaggc 24240 
gcaatttgag cagagtgggg ctggggagag 24 300 
ggggcctgac cacacggggg ctcgggggcc 24 3 60 
ctggcacttc agtttcccca tcagcccaga 24 4 20 
tggcagcctc cttacagcac attccagacg 24 480 
tgccaccggc tgggaatgtt ttcgacagac 24 54 0 
acctcagggt gccccagcgg gctgtgacct ^24 600 
ctggccctga atcagccagg cctggtgtgc 24 660 
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caagacctac 
aaccgacttc 
gatcccagca 
agcctggcca 
gtggcctgtg 
tgagaggtgg 
tagccaaacc 
cctgtggccc 
aggcagaggc 
agaccctgtc 
gaaacagctc 
cccaccattt 
tggctggtca 
atcccctgag 
taaaaattca 
ggaggctgag 
ctcgccgctg 
aaatacaaca 
aaaagtggga 
acgggaggcc 
atggtgaaa-c 
gtaatcccag 
ttgcagggag 
ctaaaaacaa 
atggaagggg 
cttaaagcaa 



agacaccccc 
tgaactattt 
ctttgggagg 
acatggtgaa 
cctgtaatcc 
aggttacagt 
ccatcactac 
catctacttg 
tgcagtgagc 
tcaaaaaaaa 
attgcccaaa 
attttaacta 
tggtggctca 
gtcaggagtt 
aaaattagcc 
gtaagagaaa 
cactccagcc 
ttaattattt 
aaccatatat 
gaggcgggcg 
cctcatctcc 
ctattcagga 
ccgagatcac 
aacaaaacaa 
atggcccagg 
cgttcatttt 



tgcacccctg 
caccattcct 
ccgaggcggg 
accccgtctc 
cagctactca 
gagccaagat 
aaataataca 
ggaggctgag 
tgtgattgag 
aagaacataa 
ttccagcctc 
catctgtctc 
tgcctgtaat 
cgagactggc 
ggacgtggt a 
tgcttgaatc 
tgaacgacag 
ttcttgctta 
ctggccgggc 
gatcccctga 
aataaaaata 
ggctgaggca 
accactgccc 
aacccaacca 
tccctggagc 
cttacatttc 
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caggctggca 
tatgcgttag 
cggatcacgt 
tactaaaaat 
ggaggccaag 
cgcaccactg 
aaaaaatttt 
gtgggaagat 
ccactgcact 
tctgggtttt 
gcaactctgt 
caccactcct 
tccagcactt 
ctggccaacg 
gtgggtggtg 
caagagactg 
agcgagactc 
agttttacga 
gtggtggctc 
ggccgggagc 
acaaaaatta 
gaagaatcac 
tccggcctgg 
agcaaacccc 
ccctgtgaca 
tggaaatgaa 



gagccagaaa 
tcttttcttt 
gaggtcagga 
acgaaaatta 
ggaggaaaat 
cactccagcc 
gttggctgtg 
gtagaattgc 
ccagcctggg 
ggaataagac 
agccgccacc 
gtattaagta 
tgggaggctg 
tggtgaaacc 
cctgtaatcc 
aggttgcagt 
cgtctcaaaa 
agagacttaa 
ccgcctgtca 
tggagaccag 
gccgggcatg 
ttgaacccgg 
gcgacagagc 
acagagtcga 
aattaccaca 
aagtccaaaa 



cttgggtgga 
tatttgatga 
gtttgagacc 
gccgggcatg 
cacttgaacc 
ttgggcaatg 
atggtgcctg 
ttgagccagg 
cgacagagcg 
agcagtttct 
accccccagc 
aatgcaatat 
aggcaggcag 
ctgtctccac 
cagctacttg 
gagctgagat 
ataaattaat 
tatcaccatc 
tcccagcact 
cctggctaac 
gtgggtgcct 
gaggcggagg 
gagactctgt 
gaatcgctag 
aactcggtgc 
tcaggactgc 



24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
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ggggctgaag tcaaggtgtg tggaggcctc gctccctcca gaggccctgg ggctccttcc 26280 
tgcctctccc agcttttgaa ggctccaggt gtgcttggcc tgcggccaca tcactcccgt 26340 
ctcggtctct gtggtcgcac tgcagcctcc tcgtctgcct gtgtgaaatc tcctcctgtc 264 00 
tccgtattgt gaccgcgttt aggatgcccc aggacaatct tctccatatc gttcagatct 264 60 
tcatggtgtc aatatattga gactcttttt ccaaataagg caaatgtcac attctaggga 26520 
tcagggtggg gacttacctt tgggccaacc acagaggcta caaagaggaa gacaccactc 26580 
aatacaaagc gtgcgccagc ccagccctga tcggtgtttg ttgttgttgt ttttgtttga 26640 
gacagagtct cgctctgtcg cccaggctgg agggcagtgg catgatctca gctcattgca 267 00 
acctccgcct cctgggttgt atagattctc ctgcctcagc ctcctgagta gctgggatta 267 60 
caggcgtgaa aaggagcaag gctctgcccc agccacagcg cggatgcacc ttgaggatgt 26820 
catgctcagt gaaagacgcc agacacagaa ggacacacag tgtgtgatcc cctttatatg 26880 
aaatgtccac aacaggccca tccacagagg caggaagggg atgtgtgggt gccgggggct 2 694 0 
ggcagagggg atgagtgaca gctgatgggg cttcttcttg cggtgatgga atcttctgga 27 000 
actagacagt cgtggtggtt gcacaactct acgaggtact aaaatcactg aactggctgg 27060 
gtgcagtggc tcatgcctgt aatcccagca ctttgggagg cagaagcagg tagatcacga 27120 
ggtcaggagt ttgagaccag cctggccaac atggtaaaac tctgtctcta ctaaaaatac 27180 
aaaaattagc tgggtgtggt ggcaggtgcc tgtaatccca gctactcagg aggctgaggc 2724 0 
aggagaatcg cttgaaccag ggaggcagag tttgcagtga gccgagatcg caccactgca 27300 
ctccagtctg ggtgacagag ccagactccg tctcaaagaa ataataataa aataaaatca 27360 
ctgaactgta cagtgtaagt gggtgaattg tgtggtatat gagtgatgtt tccgaggtgt 27420 
cattaaagaa actcagacgc ctggggtggg gccagtctca ccgctgtggg tcccatcccc 274 80 
atcatttctc acaaggccct cagatcaccc ttccgcggtg gggggcggac actctaagaa 27540 
gggaagacct gggctcctgc tggcgagaag gcggtggaca tttcttcagt gtctggtgcc 27600 
gcgccctctg cccagcgtgc tccgtggagg gtctcattgt cttcctccag acgtctcttt 27 660 
actggcccat tttacagagg cggaaccgaa gcttggggtg ttggccacag ggctctagtg 27720 
tgggaagcca ggccaggctg gacctcagcc atggggaccc ctgtccctga gactgtggca 27780 
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cctgccacac cctctgtgtg acccgcctaa gccaggaaga gagggtcagg agatgcctga 27840 
gccaccaaga aggcatccca gcgtccagcc agaccggtta tccctccaga gggctccccg 27 900 
gcaggacagg ctggtcgcca tgtcttcagc ctggtgctat ttaaaggtgg gtgccacctg 27960 
gggctgtggc cgcagggcca ggactgggct gctgggagct gtgtccccac agcggaggtc 28020 
gccgcccctc tcaggcctcg gtttccccag ttgtcaatgc ctccacttgg ctgtgagtct 28080 
gtgagggtca ctgtgctcac cttttggggc ccagcgcatg gggcaggcag aggaagggtg 28140 
ggggccagcc gccttgctgg gtggttcccc gtggggcctg gggtatggct ctaagggagg 28200 
agcaagtgtg ggtgcgaatg gggccgcccc attcctgccg cctccgacgt gccccgccag 28260 
ccggccaccg acaggtctac gtggctatcc tccctcctgc ccacctacct gcccaaacac 28320 
acgtccccag tcgtcacctg cccacccacc cgcgcattcc cacacccttg tgggcctggc 28380 
tttcgggaaa ctacaatttg cggggagaga agtcccacga gggcatgccc cggagcctgg 28440 
ctggtcccac ggctgacgca cgcggcagga cctcccgtgt ccatctctgt ccccaagcat 28S00 
ctccgcctct gcccctctct gtctctgtgt ctctctcgtc tctcccggtc atcttccttg 28560 
tgtctcttga ctgccgccgt ctttctgtct ctgtctccct ccgggtctct gtctccctcc 28620 
aggtctctgc ggcccgcgtc tcacactccc gcccccgcaa cccgaggtcc tagcccgccc 28680 
ggggactcgg ctgactcacg gacacgcccc gcgagacaaa caacaaacgc gcggaggccg 28740 
agcgcggagt cccgcacggc cgcgcccctg tgcacctggc ccccgccccc gagacgtccc 28800 
attggccggc gccctagcct ggtcccgccc aagtggaccc cgcccccgcc ccgaggcacc 288 60 
ccattggccg gcgtccccgc cccagcgaac ccggccccgc ccccgaggcg ccccattggc 28 920 
cccgccgcgc gaaggcagag ccgcggacgc ccgggagcga cgagcgcgca gcgaaccggg 28 980 
tgccgggtca tgcgccgccg cctgtggctg ggcctggcct ggctgctgct ggcgcgggcg 29040 
ccggacgccg cgggaacccc gagcgcgtcg cggggaccgc gcagctaccc gcacctggag 29100 
ggcgacgtgc gctggcggcg cctcttctcc tccactcact tcttcctgcg cgtggatccc 29160 
ggcggccgcg tgcagggcac ccgctggcgc cacggccagg acagtgagtg cggggcggcg 29220 
ggggcctggg gtggggaggc ggcgggtgac ggcaacgcgg ccgccgtctt cacggtgacc 29280 
tgcgcccgcg ggggagtccc ggaggctcct ctgtgcagcc tcggcctcag tttccgtggt 2934 0 
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„ nflf . tflcac tgttaaagcg aaggctgcag 29400 
ctgtgagatg ggtgcagcct gcctggtggg agggttgcac g 

t-rrntotaot acaaccctgt gggtggggcc 294 60 
cggcggaccc ggctcagggg cagagaagcg tccgtgtggt 

anaaaaactQ gggcaggccc agctgcatgg 29520 
acccatctgc aggtgggaaa ctgaggctcc agaggggctg ggg 

«- ^nacatca cagaatccag tcagggctgc 29580 
cggaagcggc ggggggctga cctccggact cctgacatca 

ct gag tC ggg gccccc tCt g _ gacacccca, ctggcagg.g aggacaagga 29,40 
ggeacacaga aggga.ggg* -gcccagg gtcacactga cagggg.ggc ggagc.ggg, 2,00 
CCCC acaggg cccaggacg, cacggagcgg gcg t c t c t g t ccccaggg.c tgccgagcac 29^0 
act gagg t ag gccc.cagtg -gtggaat gtcaggagca agaggagagg c.gggcacag 29S20 
cagggga t g t ^ aggccagggg agtcggtgtc cccgccgggc ggggggcac, 293B0 
gggaaggggg cccgggcccg c.ggctgccg cc.gaa.cac cacca.cagg gcagg.aa.c 29940 

M-ttcatctg ggcgccaagg ccctcattag gccgcacgtg 30000 
accccctgtc cttcccaccg ctttcatctg gg g 

acgaggg = gg — " t9 9C "" C9 " 30060 

gtgg « t e g9 — - — ' -" 9 "" 9tC " t9C " 30120 

ctctral9 o 9Ct9S9 " Ct 30180 

mt t c,t 9ca «c g « gg — ~™~** — ' 3024 ° 

„ tggggg >= c=, 9t „t g , e a , aga =,== -ec«« g -~ ' 3 " 00 

^ ag c„« ^c.eW.' — "* 9 "" 9t 9aSC99tt " 30360 
^ t„=«-o* — 9a " C9 " Ca9,CC 30420 

a=gag9t = a5 9 .*™ " a """ 9t= """"" 30480 

« '" 9t "" 9CC " t C9 " a "" 9 30540 

, gg „ gg „, .t g »c 9tg ,. g =c ggg .^ " 30600 

t ,^ccac c= tggg c g .c a 9 «— «—« " 30660 

„ gtg a atg a c g t„ac,. g „«,=.«*, 30,20 
g a ggc .=c~ g - gggg . 9 ggg c,. ggaa ««««« 30 ' 80 

„., g ,= g cc „ — a. g gtg ~« gg . « gg — ' 308,0 

gg9 , gg=tg c c.c g , gg a 9 c ««.««, »™« "™ ' 30900 
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gt acaccca g ct g acaaa g c gg ca g acacc ca.gccgggg t gag c g a g t g tgggtgagga 30960 
gtggcggctg gccccagggt cctt g ct gg a caa g acactt ca g ctca ggg tggggcaggg 31020 
ctcaccca gg g cta=ccaca g ac g at gg c g tccaaatctg gctctgccac tccca gg cct 31080 
c „ct„ccc ctctgcaacg tgggctgctg a g c ggg ctt g g ta gg acagc tggcatacag 31X40 
tc gg cgctca a g catgtct g t gg t g tccca taaaccacc. g t g tcccact cta gg ccact 31200 
gccagcccgg cctccagtcc agag.cccag tccggagtcc cagtgactgt g c g t gg gccg 3X260 
gg ca g ct g ag ctgtgagggc cgggctgggg gctccatatg gggtggtgtg agctgt g a gg 31320 
gcc ggg c tg g gggctccata t ggg g t g g t g t g a g ct g t g a gggccgggct gggggctcca 31380 
tatggg gtgg tgt g a g ct g t g ag gg cc ggg ct ggggg ctc catat gggg t ggt gt g a g ct 31440 
gtgaggg ccg gg ctg gggg g tccct gggg t gg t g t g a g ct g t g a ggg cc g ggct gggggg 31500 
tctctggggt ggtgtgagct gtgagggccg ggctgggggc tccatatggg gtggtgtgag 31560 
ctgtgagggc cgggctgggg gctccatatg gggtggtgtg agctgtgagg gcc gg gct gg 31620 
gggg tccctg gg g t gg tgtg agctgtgagg gccgggctgg gggctccct. gg g t g gt g t g 31680 
ag ctct g a g g gccgggctgg ggggtctct. ggg t gg t g t g agctgtgagg gccgggctgg 31,40 
ggg ctccata t gggg t gg t g t g a g ctct g a ggg cc ggg ct ggggg ctccc tg ggg t g ct g 31800 
ct gg tc g ct g .ctcattgac agttatca.t gg tct ggg tg gg ccct g ccc cttctgactc 31860 
ccacatccca ggaacccttt cccaaccttc ctc g t gg t g t tgct g ccccc ctgacgtccg 31920 
tccctct ggg t g t g tggga g ccccccc.cc atacacacac aca g at g ct g ctctt ggg ct 31980 
gagct g=a g g gaca g c g ct g .cctggccct cccac gggg t cctcatcgat ctct.cactc 32040 
ccccagctcg tgggggccgt cctgcttccc gttccctctg cctgctcctt gctcctccct 32100 
cacat g ct gg gggg^ctcc tggtgtcagt cacggctctg ggggatcctg agtgtccgtc 32160 
gtggtcggga ggggactcgt ggtcccgggg gtctcctggt atctgtcgtg gtcctgaggg 32220 
ccctgcacga agcacagcgg acagcagcgg tgct gggg gt g a g cca g caa gg ccctcccc 32280 
gacccccgcc tcccccaggc atcctggaga tccgctctgt acacgtgggc gtcgtggtca 32340 
tcaaagcagt gtcctcaggc ttctacgtgg ccatgaaccg ccggggccgc ctctacgggt 32400 
cggtgagtgc cgggcagggc tgggcggcgc gggcagggtg gggagggt.. gccg.cctca 32460 
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ccccc g ccc g ca.c.actct acacc g t g ga ctgcaggttc c ggg a g c g ca tcgaagagaa 32520 
cggccacaac acctacgcct cacagcgctg .cgccgccgc ggccagccca tgttcctggc 32580 
g c t ggacagg aggggggg.c cccggccagg cggcc.gacg cggcggtacc acctgtccgc 32640 
ccacttcctg cccgtcctgg tctcctgagg ccctgagagg ccggcggctc cccaaggtgc 32700 
ctgggct.gt ggcgaggg.c ccggccacgc ttgttcttcc ccctgcgggc tctgtaagcg 32760 
otgagtgccc accgtgtgcg ggcgct g tg g acacagccca ggagccctcc aggggggtcc 32820 
cagcctgagg gggtggtggc caccaagcag gttcaatcct gagttgggga cctcgaggac 32880 
ccaacagggc gcctctcggg ctgaaggacg cagacgtcga aaggtcgagg gggacgtccc 32940 
aggcagggcc cggcagaggc aggggctcgg ggtggggagc acgttgggag tgggggcagg 33000 
agcggagggg a gg g g a ggg9 g =c ggg gaga cggtgacaga cgccgcagaa caccagcctc 33060 
gaagccggtc ccgtcccggg aatctgcaaa tacaacgcct tgcgaggaca aaggcacctg 33120 
caggtgggac ggagatggag gagcatccag ggtgggg^t ccagggcccc agtgtcctca 33180 
cagggtcctc acgacaggag gcgggacagt gagagccaga ga g agat ggg g atgggccgc 33240 
gctgtggccg tgaaggggag gaagggccct aagctgaggg acgtgggtgc ctccagatgc 33300 

arnntt ccacac tgga gcccccggga gggaccggcc tgctcctcjcc 33360 
tggggaaggc gggaacggtt ccgcactgya y 

tt gatatgag cccagtggga ccca.tttgg actctggcct ccagaaccgc cagaaaataa 33420 
acgtagtaa. ccatcaactt tgtggtcttt t.ttaca.ca gacgtcggaa atatgcacac 33480 
ggtgt ctgaa actgttctca tgacaaaata agcctcagat cccccgggga agggcggagg 3354 0 
ccaacgcctc ggtgttcctc cgatcccccg ggaa gg gcgg aggccgacgc ctcggtgttc 33600 
ct cggatccc ccgggaaggg cagaggccga cgcctcggtg ctcctcagat cccccgggaa 33660 
gg g Ca gaggc tgagggcagg agccgtgctg ggtg ca g ggc aggcc.gggg gcttcatgcc 33720 
gCtg tcctgc gggacgcaga gagggctggc c g tc gg t g t g ggggcgcccc cacctgtgcc 33780 

aactccqctg ggacttctgc ctacagccct gggagtcaaa 33840 
cagcgccctc ctgacatcct gactccgcty yy 

ctccagcctc tcagagaaaa ggtca.a.cc aagagcccca cagcctggag ccaggcagtg 33900 
acaccctgg. cctgtctccc cttctgtgtg tggggcgaca gcagcatcgc cctgg.gaag 33960 
tccccgggga cggccag.gc tccatcccca gccgccgcct tccacataaa t.c.gg..g. 34020 
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ctggg cc gag g cac ttg c tg gg a ggtg c tg a.ca.cct.a cac gg aaaac cct t ct ggg a 3,080 
a gggaggg tc gtg ccc gg cc c gag a gCtt c t.ctcaccct g ca g aca g aa ^cccca 34140 

arrflcaa cctct gggg a catctttggc tggcatggag t ggg tg g agg 34200 
ccccagggga caccaggcgg cctctgyyy 

ac , g , 9Ct5 c a=cca gg . tg t cccca, g « *a~cw 3,260 

g c=a 9 tc gg a TO *c,cc .«««««, "="" 9=t 3 " 2<> 

g ,a.<a.a.c » g e tgg c»c ««, MOCC. 34360 

„c. gg ec gg ,c g = ggggg c t „^ t ,. 3,440 

tgg ,t=.c g . gg ,-„«- " 99t9 "" " C9t9tC " 34500 

«,a..,tac aaaaa.tta, =c g9 ,= gtgg t g9 = gg9 c g = «^««=c 

g . gg ct g a gg c gggg 3,t= 9 «t g aatc= g gg a gg e gg a, g tt g =a gtg8 g c tg a g ,t= g 34620 
c g ee.ct g c. ct=ca g =ct g gg c g ac. g a g ca.aetcc, tetc.aaaa, aaaaaaa... 3«80 

fttctttatt tctaaaaeag a g c g t gg aat sggcgtscg 34740 

aagaaacagg tcagcagttg tttctttgtt 

wc „ c=c, ggg = ag t g .,.,ccc g g «=a=a=a g . 9 -tc.,c , g . t t,« cg 34S00 
c,a g ==ca.a c« gggg acc ccc g tt gt c= tca gg =. g t g a^,^ ecc==aa=a g 34660 
a g , gg . g c gg c« gggg ,ca =. g aac=a g c ,aaat= g cca 3.«. 

taa g .c..cc cc.,.= tg tt g c.aaet g t g cttcc g c«, c g ,. g c«c c tg a g = gg o 34,60 
ggg c, g .t gg gg . g . ggg =, 9ct g « gg c, c g . ggg ,ccc gg9g .c g =a 5 350,0 
ct tacca ggg =«t g t=ct g t = gtg =a g „ g =- ggg a.,a =acca gggg = 35X00 

gg ccac«=t t g acctc g . g =>.t g =, g ~ t=.c. g ccce =ac=. ggg < g 35160 

cc ggtgt cct Ct „ 9 coc„ c g cccee g a g gggtgggg c 3 aaccaatc, 35220 

t=ct grt c=t ctgg ccactc caeca,,, aa g tccca g = gg = g c,cacc 35260 

=cgg ca g =at «ct g ae.a. gg ccctcc, g tt=c g .,t« 35340 
cc-cw electee t =tc c a g c t = =aa««.a= t=cc.a g .,e cacc,c gg a= 35400 
,c,c, g aac= c g a g e=t tg t ctccctcaac tcaa.actcc «„«ce«o 35460 

,„„„=„ = t = gg cc ggg «.ct, t ,ac == gg a g c. gg C,ccc„« tgt c g c g =a g 35520 
,etc gggg ce taa.a^tt g tt«ctca g tc= gg . 9 at= „ gg ae g a« = g , gg ta.cc 35560 
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tccctacctc ggtg tcctcc at.caacctc g tct t a ggg c acc ggg tac g ttaoctc.t. 35640 
aggag cc g a g t cc g c ggg tc c tgggg tt g a g at g t gg ac g ccctca ggg c tgg cac t ct g 35700 

fMtfl aaaatcccaa cgcttctctc ggctccgcaa ccccagaggg 35760 
ccctggcggc cacagtcatg g aa g tccc«a y 

nanaaccCQC cac g C acgac cccagagggc ggccaccagg agggcccgcc 35820 
cggccacgag gagggcccgc cd«-yv,c ^ 

,e g c 9 c 9 a.=c c=. g . g99 c 9 soccer, ^ 35880 
. g .. 99 t g =c c tg c 9 c,c 9g t =c 9g .c. g9 t^atcc „«ac= tgg c ^ 35,40 
g ,c g ca 9 .c. = g tc 9 = gg9 a c«. gt ,.a, 

tt g9 . 9 ».c. g ,» gg „c g tgggg c,= g c 9 c=«ccac g C„ SW o gt c„=c gCg 360,0 
g a 9gg .=ac 9 99 t 9g t==t 9 ct^wotc etctcc g tt, g c tg tc t cca «e gt « g a« 36 12 0 
c tCgg .tecc . g9 .^t g9 9 a t c.tca g c aa=t 9 a g a tg — cotcccoc 36!80 
t „ 9OT =. g9 «c = c.c 9 =c« g =ca g< = t g t 9 =cc 9 a g c t9g e tg c,= ccc g99 cca g 36 2 40 
, caKca ,„ ,cc«,a 9 =. 9 t 99g9 tcc 9 ,<=t t «c,. . 99g9 cc» g , aacec g c 9 t 9 36300 
g =t„ g9t9 t S aec,..« t g9g9 c, g . 9 9 = 9 ct gg9 <== ct gg c 9 cttt „c 9 ct 9g „ 36360 
t«ct M t« t.a.tttcc 9 aec=. g t,. ct g cta=« C g cc. 9 =. 9 « 36,20 

ctc= tgg9 =t taa«tc gg9 a,. g =a g ttt t,«„co„ ccct 9gg t,c < = .. g «ct g c 36480 
,„„„=. g .oo=ct 9g . g .=. g9 a=c g ,a« t ,„, 9 cctc==a g c 36540 

tW ««« c=t 9 ,c 9 =ac ggg c g ea g aa g9 ..a 9 cc,c a 9 cacc g9 ct 36600 

a gt , gtgt »t """"" """"" " aa9tCt9 " 366 " 

gg , g c. g «t tg «=tt g ,a «t tg ct 9g t catcctcat, g t«c g a 9 ce 367 2 0 

ggg tc 9t9g . ««■«■«• ™— <" " 9=CC3tCC tSCTC *" 36780 

tg ca 9 =a ot9 w*** «9~~=«Jt g =~c, g e 9 c t g9gg9 . 9 cc g ct g99 ce=c 36840 

gtcttc c 9 c c a=.aa=ca tg cat 99 c= 9 c= ac gtg . g «e ...c, tC c g t 36 5 00 

gcag t,.t.a tttaa.atta tum.c tcc.cc g ct g ..c g «t'a g . g9gt ,» gg « 36960 

, 9 ,c. 9 a g9 . c 9ggg . gg « «««*=« ca 9 a g9g9 ao =. tgt9g ccc . C9 ~ t tccc 3,0 2 0 

„ g ==a g99g g =« 9 t ggg c = 99g c== 9g9 t «t 9 ccc„ g ..o. 9g c 9g 9 a=c t9 =. 9 c 3,060 

gc „ac=a 9 c ca. g c gtg9 c g oc g =c 9g9g =«««*=« ,t g99 t g =c, tg t 9g c g =t, 31140 
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g « gagggtg w« «™ — * 9 - 37200 

g c« gtgg c g ««««,= tao,c g > g ct S ,a 9 ,a g „= o»^c g ,c g »«» 
g «„e=eca ' 95Ctttt " 9 3 " 20 

g ca 9 .c, ggg o^t* ,c« W ™, «~- "3.0 

«, gagg c gg 9 , g , g « g c 9 g te gg9gtgg 37440 
cc, g .c. gg9 «=^~, *^^« g =, ggg c„c tg «oc,c.c 3,500 

„ nmractc qagcccggag cctgcgggag 37560 
gctcacccag cgcgtagtcg tccttgcaca cagggcactc g g 

tgt gcagc t g cgg.cacagc gggcgtgggg ^ ,cctt«.„ gcaggctact 3,620 
CC acagcc t c agccggaggc cgcccc.gag cccagcgagg ggagaaaagc cgtg.g^ 37680 

„„ rtrtc ctcccagccc accttcaggg 37740 
tcccccgggc tgccagaggg gacctggaca gaaccctctc etc 

, nnrtcac acctgtcatc ccagcacttt gggaggcega 37800 
aaatgetega ggccgggtgc ggtggctcac gectg 

gg c, gg , gg . ™ «"»"« 37860 

tgt ctcta« — 9 — C9C " 9 """"" 37 " 0 

ta = teggg . g .ct^tc. «™ ^ " tg,9ga " 31980 

g cc tg90 t« tot«««e tg =co,t ggg ggg =ct g t 9 t «t tC „ac„ »t g , ggag = 3*0,0 

tg „a g ac„ g . gg . g .« 9 - ™" < ~"~ 0 " 99 ""'" a 38100 

cgcg a gg ct g .a*,,— « ~«««~~ 38160 

38186 

ttggagcccg cccggggtcg gagatc 

<210> 39 
<211> 720 
<212> DNA 
<213> Unknown 

<f23> Description of Unknown or g .ni S m=Onkno-n 

" c "" t9a * C9Cacw * 60 

tcgcat = gg o ,~ gg - g c ' " t99 " t5a CC99C " C " 120 

ggagaagg . g — - — 9=t " tcac9a " ct9ac9 " tg 180 
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tcacgtgtgc ggcatgatca tcactgactt ccccggcccc aagggccagg cggtcgaaaa 24 0 
gcggggagtg aagaaatttt gttccaccgc cgaaatgctt ggttggtggc tgcagccgga 300 
aaaccgtctg ctcgatgcca agctctacgt ccacgacatg gggcgcagcg tttgggaaaa 360 
gccggatgac ggtcatctga tcgacgcaac cagcgcctac tatgtggtcg gtacgtcact 420 
caaaggcgcc atgggcgcgt cgcttgcaag ctttgccgag gagcaggacg ccaaggcgct 480 
tgccggcatg cacggcggtc gtgtgctgcg cttcgaggaa atcgatcagg cgctgctgca 540 
ggaggctgca agcatgcagc acggcggcat gcacgaccat gcgccaaacg gtgcacataa 600 
cgcacacgca ggccactgag cagcagtggt ctgaacagca cacacaagaa atcgaggtaa 660 
gcacaatgat gggtatcagc gtctggcaac tcctgatcat tcttctgatc gtcgtcatgc 720 

<210> 40 
<211> 127 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (9) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (101) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (119) 

<223> May be any nucleic acid 

gcggccgcnc ggcgctggct gctgtgcgga ggccacggcg ggccgcgagc cgcctcgtcc 60 
tcgccctcct gccctgggtg cggccccccg ggtcccggcg ncccactcgc cccggcgtnc 120 

127 

ccgcgct 



<210> 41 
<211> 6858 
<212> DNA 
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<213> Unknown 
<220> 

<223> Description of Unknown Organism : Unknown 
<400> 41 

actcgccaag tgatcgaccg gcccctgagg gccgcgacgc agagggcgcc ccgtgcactg 60 
gcacaggcgg ccttgtgcgt tagactctga tattcgtgcg ccctctcgtt ggcaggacca 120 
tccatcctgt gtgccggggg ccgcgcacac cgatcccgga tccgcctcgg ccctgccctg 180 
cgcgcccctc cgttctcgac ctccccgacg ctgtctgaac acgcgtcgcc gggggacgac 240 
ggcgggcggc ccgcctcggg ggaggggtaa gcgtcccggg atgcccgttc aaccgttccg 300 
caaggctcgc ccatcgtggg ggagaaccgg cgcgacgcta ggagagacaa gtgatccagc 360 
aggagtcgcg gctcaaggtc gccgacaaca ccggtgcgaa ggaaatcctg accatccgtg 420 
tgctcggcgg ttccggacgc cgctacgcag gcatcggcga caccatcgtc gccaccgtga 4 80 
aggacgccat ccccggcggc aacgtcaaga aggcgcacgt cgtcaaggcc gtggtggtcc 540 
gcacccgcaa gcagtcccgc cgtcccgacg gctcgtacat caagttcgac gagaacgcgg 600 
cggtcatcct gaagaccgac ggcgagcccc gtggcacgcg catcttcggc cccgtgggtc 660 
gcgagctgcg tgacaagaag ttcatgaaga tcgtgtcgct cgccccggag gtgatctgac 720 
ctcatggcca agatcaagaa ggacgacctc gtgcaggtca tcagtggcaa ggacaagggc 780 
aagcagggca aggtcctgcg cgtgttcccg acggatgagc gcgtgctcgt cgagggcgtg 840 
aaccgcgtga ccaagcacct gcgcgccggc caggacaaca acggttccac cgagggcggc 900 
ctgcaggtcg tcgaggcccc gatccacatc tcgaacgtgg ccgtggtgga cccggagacc 960 
aagaagccga cccgtgtggg ctaccgcttc gagaccgtcg agaaggacgg cgtgacgaag 1020 
accgtgaagg tccgcttcgc caaggcctcg gggaaggagc tgtgatgacc gaggtgcagc 1080 
agaccgagaa ggtcaccccg cgtctgaaga ccaagtaccg cgaggagatc cgcggacgcc 1140 
tgcaggagca gttccagtac gggaacgtca tgcaggtgcc gggcctcgtg aaggtcgtcg 1200 
tcaacatggg cgtcggcgag gccgccaagg actccaagat catcgacgac gccgtcaccg 12 60 
acctcaccgc catcaccggc cagaagccga tgatcaccaa ggcccgcaag tccatcgcgc 1320 
agttcaagct gcgtgagggc atgcccatcg gcacgcacgc caccctccgt ggcgatcgca 1380 
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tgtgggagtt cctggaccgc ctggtcacgc tgccgctgcc gcgcatccgt gacttccgcg 1440 
gcctgtccga ccgccagttc gacggcaacg gcaactacac cttcggcctg tccgagcaga 1500 
ccgtgttcca cgagatcgat caggacaaga tcgaccgcgt gcgcggcatg gacatcaccg 1560 
tggtgacgac cgccaagaac gacgacgagg gccgcgcgct gctcaaggcg ctgggcttcc 1620 
cgttcaagac cgaccagtaa gacctccacg ccacaggtcc tccaccggtg aaccggtggc 1680 
ggaaaccacg gcgagaaagg gcgtgaagca catgaccatg accgatcccg tcgcagacat 1740 
gctgacccgt ctgcgcaacg caaactcggc ctaccacgac accgtgtcca tgccgtcctc r 1800 
gaagctgaag actcgcgtcg ccgagatcct caaggccgag ggctacatcc aggactggcg 1860 
cgaggaggag gccgaggtcg gcaagaagct gaccatcgac ctgaagttcg gcccgcagcg 1920 
tgagcgtgcg atcgccggcc tgcgccgcat ctccaagccg ggcctgcgcg tgtacgcgaa 1980 
gtccacgaac ctgccccacg tgctgggcgg cctcggcatc gccatcctgt ccacctcctc 2040 
tggtctcctc acgaaccagc aggccgccaa gaaggctggc gtgggcggag aagtcctcgc 2100 
ctacgtctgg tgacgggcaa gacggaagaa aggctgaact gacatgtctc gaatcggacg 2160 
tctcccgatc accatccccg ccggcgtcga tgtgaccatc gacggcgacc gcgtctccgt 2220 
gaagggcccc aagggcccca agggtcagct cgagcactcg ctgcccacgc ccatcacggc 228 0 
caccctcgag gaggggcagg tcaccgtggc ccgccccgac gacgagcgtg agtcccgctc 234 0 
cctgcacggt ctgacccgta ccctcatcag caacatggtc gagggcgtga ccaacggctt 2400 
ctccaagcag ctcgaggtcg tcggcaccgg ctaccgcgtg caggccaagg gccaggacct 24 60 
cgagttcgac ctgggctact cccaccccgt cccggtgaag gtgtcccagg gcatcacctt 2520 
cacggtggag ggtaacaggg tcaccgtcgc cggtatcgac aagcagcagc aggtcggcga 2580 
gaccgccgcc aacatccgca agctgcgccg ccccgacccg tacaagggca agggcgtcta 264 0 
cgcgggcgag cagatccgcc gcaaggccgg aaagaagtga tgtctactct gaaggtgaag 2700 
ggcaagggca agttcaacgc ccgcacccgc cgccacctcc gggtgcgcaa gcggatctcc 2760 
ggcaccacgt ccgtcccccg cctcgtcgtc aaccgctctg cacggcacat gttcgtgcag 2820 
gtcgtggacg acacgcagag ccgcacgatc gcgtacgcct ccaccatgga ggccgacgtg 2880 
cgtgcgctcg agggtgacaa gacggccaag gccaagcgcg tgggcgagct cgtcgccgag 2940 
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cgtgccaagg cggccggcat cgaggccgcg gtcttcgacc gggcgggcaa caagtaccac 3000 
gggcgcgtcg cggccgtggc cgacggtgcg cgagagggtg ggctgcagct gtgaccgaga 3060 
acatcaacca gaaggacact caggtgaccg agagcaccga gaccaccgtc tccgagaccg 3120 
ggtcgggctc gcgagccaga ccaccgagcg cgccaccggt ggccgcggcg gtcgcgacgg 3180 
cggccgcggt ggccggacgg cgatcgtcgt ggcggccgtc ggacgaccga accgtcgtgg 3240 
cgcccaggac gacgaggaag gaccagttcc tcgagcgcgt cgtgggcatc aaccgcgtct 3300 
ccaaggtcgg ccgccgcttc tccttcaccg ccctcgtggt ggtgggtgac ggcgacggca 3360 
ccgtcggcgt cggctacggc aaggcgaagg aggtccccgc tgcgatccag aaggccgtgg 3420 
aggaggccaa gaagtccttc ttccgcgtcc cccgcgtcgg ctccaccatc ccgcacctgg 34 80 
tgcagggtga ggacgccgcc ggcgtcgtgc tgctccgccc ggcctccccg ggtaccgcgg 354-0 
tgatcgccgg cggtccggtg cgcgccgtgc tcgagtgcgc cggcatccac gacgtgctct 3600 
ccaagtccat gggctccgtg aacgcgatca acatcgtgcg cggcacggtg gagggcctca 3660 
agaagctgaa gagcccccag gccgtcgccg cccgccgcgg caaggccctg gacgagatcg 3720 
ccccccatgc gatgctgcgc accatggaga acgatcgcgc ccagaagagc gcgaaggcag 3780 
gtgcgtgacg cgtgtttgag tccactcgca agaacatcca gccctcggac gccaccctgg 38 4 0 
tcatcaccca gacccgcggc gtcacgggct ccaagcagaa ccatcgggac accctgcgct 3900 
cgctgggcct gaagcggatc ggccaccagg tcacccgcaa ggccgacgcg gtgacggtcg 3960 
gcatggtcaa caccgtgccg cacctggtgt ccgtggagga ' ggtcaacaat ggctgacaac 4020 
gacgccatca aggtccacga cctgcgtccg gcccccggtg ccaagaccgc caagacccgc 4080 
gtgggtcgcg gtgaggcgtc gaagggcaag accgccggtc gcggcaccaa gggcaccaag 414 0 
gcccgttacc aggtccgtgc gggcttcgag ggcggtcagc tgcccctgca gatgcgtctg 4200 
ccgaagctcc gcggcttcaa gaacccgttc cgcacggagt accaggtcgt gaacctggac 4260 
aagctctccg cgcacttccc cgagggcggt gaggtcaccg tggacgcgct cgtctccaag 4320 
ggcctcgtcc gtcgtggcca gcccgtgaag gtgctgggca cgggggagat caccgcggcc 4380 
gtgcaggtga aggcgaacgc cttctctgcg tccgccgtgg agaagatcca ggccgccggc 4 4 40 
gggtccaccg agaccctctg acacgccgac ccatcgaccg agggccctgg ccggagcagc 4500 
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cgctcgggcc aggccctggt ccgtccgtgt agactcgcac agccgccccg gtgtggccgc 4 5 60 
cgtctcgtgc ccccgccccg cggaacggcg cacgccccac aggaccagcc gcaggaggac 4 620 
tcgtgctcaa ggccatcgcc cggatcgtcc ggacgcctga cctgttgcgg aagatcgcct 4680 
tcacgctcgg gctcatcgcc gtctatcgga tgggcgactt cgtgccggcc accggcgtgg 474 0 
actacccggc ggtgcagcag tgcctggcag cgggcaacgc ccagggcggc ctgtactcct 4800 
tcgtgaacat gttctcgggc ggggcgctcc tgcaggtgtc tgtcttcgcg ctgggcatca 4860 
tgccgtacat cacggcgtcg atcatcgtgc agctgctgcg cgtggtgatc ccgcgcttcg 4 920 
agcagctcca ccaggagcgc cgcaggggcc aggcgacgct gacgcagtac acccgctacc 4 980 
tgaccctcgc cctcgccctg ctgcaggcga ccacgatggc ctcgctggcc cgcaccgggg 504 0 
ccctgctcgg atgcagcctg ccgctgctgc gcgacggctc catcctcacg gtgctgctcg 5100 
tggtcatcgc cctgaccacc ggctgtctca tcgtcatgtg gttcggggag cggatcaccg 5160 
agaacggcgt gggcaacggc atgtccctgc tcatcttcac ctccatcgcg gcaggcttcc 5220 
cggccggtct cggccaggtg gtccagacgc agggctggcg cgtgttcgcg atcgtcatgg 5280 
ggatcggcct gctcaccatg ctggccatcg tcttcgtgga ggagtcgcag cgccggatcc 534 0 
cggtccagta cgccaagcgg cagatcggct cacggaccgt gggcgggtcg agcacctaca 54 00 
tcccggtcaa ggtgaacatg gccaacgtca tcccggtcat cttcgcctcc tccgtgctga 54 60 
tgctcccggg catcctcatc cagttcaaca cgccgcagga cggcagtgcg ccggccccgt 5520 
ggatcacgtg gctgagccgg tacttcggct ccggtgacca cccggtgtac atggccctgt 5580 
acttcctgct catcatcggc ttcacgtact tctacgtgtc catcacgttc aacccggtgg 5640 
agatctcgga caacatgaag cgctacggcg gcttcatccc ggcgtccgcg ccggccggcc 57 00 
ccaccgagcg ttacctgcag tacgtcatca gccgcatcac gttcgtggtg ggggccctct 57 60 
acctcggtat cgtggccatg atcccgctga tcgccttcgc ggtgatcggc accagccaga 5820 
acttcccgct cggcggcacg tccatcctca tcatggtggg cgtcggcctc cagaccgtga 5880 
agcaggtcag cgcacagatg gagcagcgcc actacgaggg cctgctgcgc tgagccccga 5940 
cccgatcccg caacgccgtc cgtatcgaca gtgaggaaca cacgatgacc cgcatgctgc 6000 
tcatgggccc tcccggttcc ggcaagggca cccaggccac ccggatcgcc gacaagctgg 6060 
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ggatcgtccc gatctccacc ggtgacatct tccgccacaa cgtgaagtcg atgacgccgc 6120 

tcggcgtcga ggccaagagg tacatcgaca acggcgactt cgtccccgat gaggtcacga 6180 

accgcatggt cgccgaccgc atcgcccagg ccgacgcgga gcacggcttc ctgctggacg 6240 

-gctacccgcg cacgaagggc caggtcgagg cgctggacgc catgctcgcc gaggccggcc 6300 

agtcgctgtc cgccgtcgtc gagctggagg tgcccgacga ggagctcgtg gagcgcctgc 6360 

tcaagcgtgc cgagatcgag ggccgcgcgg acgacaccca ggaggtcatc gagcaccgcc 6420 

tggacctgta ccaccgcgag accgagtccg tcatccagga gtacgtggag cgcggcatcg 6480 

tcgcccgcgt ggacggcacc ggccagatcg acgacgtcac cgagcgcctg ctgcaggccg 654 0 

tgtactccgt gcgctccgcc acgggctccc tgcccgtgat ccagccgggc gcggagtcct 6600 

gaccccgtga tcggccgccg ctcgctcgag ctcaagaccg ccccccagct gctggccatg 6660 

cagcgcgcgg gggtggtcct gtccgaggca ctggacgccg cgctggccgg cgcgccgggc 6720 

ttcaccaccg cggagctgga cgccgtgttc gcggtggtgc tggccgaacg cggtgcgacc 67 80 

tccaacttcc tgggctacta cgacttcccg gcctcgatct gcacctcggt caacgaggag 6840 

gtcgtgcacg gcatcccc 6858 



<210> 42 

<211> 578 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (5) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (23) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (48) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (211) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (292) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (308) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (350) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (384) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (477) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (507) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (529) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (549) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (551) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (558) 

<223> May be any nucleic acid 
<400> 42 

ttctngtcta tggcagagat ggncaggttg ncgttgagca ggtactgncc atcagccgtc 60 
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ttcagcgcca ggtagttccc atcgttctgc acacccgggt ggctccgctg cttcacgtca 120 
atattagtgg caccagctgg gatggtgaca atgtcattgt agccataatt ggtgggggtg 180 
agggacccgg agaccttcct gcaggagttg nctttgcccc cacacacccc gcatttgtcc 24 0 
agcttccgag gcgagtccac cacatggtca cagccggcct tgacacactg gncacggaca 300 
cagatggnca gtgtttctgg cccacacagg gtgccatcaa tcaccttggn ctcgaacact 360 
ttggaactcg ctcctccccc gggntcggga ggaacaactt gcaggggtcc cgggggggac 420 
aacccagcat tcttggggga cccactgcag gaggattccc cgtccatgtc aagtgtnatt 4 80 
ggtgggcatt attcttctca caattgntgc tccctgaagg ttttcccgnc aaggggggat 54 0 
tcccccccng ntggaatnat tggtacttgg gtctccga 578 

<210> 43 
<211> 305 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (128) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (146) 

<223> May be any nucleic acid 
<400> 43 

catttaagtt tgctagtcct ttgcaaacag actgacgctg agtgtcctgt ctgagtcaat 60 
aagtgcactt ttacctttta acctatgccc tctacttgaa cccgagcaag gtccagtcca 120 
ctggacangt tgatgatagg gtctgncgcc ccataccctc tcctcttccc ccttaggaat 180 
ttgtgcagta ctggaggggt tgcggcaatg ggaggcctgg gtgggccgtg ctgccttgat 24 0 
atggccaagg gacccagtca ccacagtgga gacccttgtc tgcacctcag taccgcatgt 300 
ccagg 



<210> 44 

<211> 333 

<212> DNA 

<213> Homo sapiens 



305 
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<220> 

<221> UNSURE 
<222> (82) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (255) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (275) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (299) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (313) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (324) 

<223> May be any nucleic acid 



<400> 44 
ggcacaggtg 


actttagcat 


gcagagcagc 


aaagagagag 


caaccaccaa 


catcatccag 


60 


ccgctgctcc 


acgcacagtg 


gntgctgggg 


gactggtctg 


agtgctctag 


cactgegggg 


120 


ccggctggca 


gaggcgaact 


gtagagtgca 


gggacccctc 


eggtgeagge 


ctctgccacc 


180 


tgcaacaagg 


ctctggaaac 


ccgaggatgc 


caagccctgg 


cagaaccagc 


tgtgccccct 


240 


gtgatttcag 


ggggncaggg 


gccattttgt 


gctcngggac 


atgcggtaat 


ggaggttgnc 


300 


agacaaggtc 


ttncattgtg 


gtgnatgggt 


tec 






333 



<210> 45 
<211> 102 
<212> DNA 
<213> Unknown 



<220> 

<223> Description of Unknown Organism: Unknown 



<400> 45 
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<220> 

<221> UNSURE 
<222> (64) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (69) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (71) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (72) 

<223> May be any nucleic acid 

gcagcagcag cgcagcgcag agagagcagc agcagcagca gcagcagcag cagagcagat 60 
cntnctggna nnaaaaaatc gcggcagcag ctgctctagc ag 102 



<210> 46 
<211> 123 
<212> DNA 
<213> Unknown 



<220> 

<223> Description of Unknown Organism : Unknown 
<220> 

<221> UNSURE 
<222> (9) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (51) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (52) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (57) 

<223> May be any nucleic acid 



<220> 
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<221> UNSURE 
<222> (67) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (123) 

<223> May be any nucleic acid 
<400> 46 

caggcaagnc ggcacgtagg agcagcagca gcagcagcag cagcagtaac nnagtcnacg 60 
agggggngcc cgggacccaa ggcgcccgaa cagagaggcg gagcacaatc cactggtcgg 120 
cgn 123 



<210> 47 
<211> 109 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (87) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (95) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (102) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (106) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (107) 

<223> May be any nucleic acid 
<400> 47 

ggcacgcagg agcagcagca gcagcagcag cagcagcagc agagagagag cagcagagag 60 



agagagcagc agagcagagc agagcanagt agagnagagc anagcnnac 



109 
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<210> 4B 

<211> 293 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (86) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (166) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (185) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (209) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (214) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (219) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (234) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (290) 

<223> May be any nucleic acid 
<400> 48 

ggcacgaggg ggaaactgct ccgcgcgcgc cggggaggag gaaccgcccg gtcctttagg 60 
gtccgggccc ggccgggcat ggattnaatg cctg'agcccg ggtcccgctg tcttctgctt 120 
cttcccttgc tgctgctgct gctgctgctg ctgccggccc cggagntggg cccgagccag 180 



gccgnagctg aggagaacga cttgggttng cctncccana aaatgggaag gganttgggg 240 
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ttaatcgaag tcattgggac cattttaaaa ggggcttcct ggattatagn ctt 293 



<210> 49 
<211> 506 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (283) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (356) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (362) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (364) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (368) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (429) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (454) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (461) 

<223> May be any nucleic acid 



<400> 49 

aattcggcac gagcacccgg ccactgcagt cttctgccct gctggacagc agcagcagca 60 
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gcagcagcag cagcagcagc agcagcaaca gtaacagcag cagttcgtcc ggacccaacc 120 

cttctacctc ctttgagccc atcaaggcag accccacagg tgttttggaa ctccccaaag 180 

agctgtcaga aatctttgat cccacacgag agtgcatgag ctcggagctg ctggaggagt 240 

tgatgtcctc agaagtgttt gcccctctgc tttcgtcttt ctncaccccc gggagaccac 300 

gattatatct acaacctgga cgagagtgaa ggtgtttgtg anctcttttg atgtgnctgt 360 

tntnaacntt tgactgacag ggacatgcct tttttggttg ggacccagat tttttgactt 420 

gggggtttnc ttgggacttt tcaaccgacc ctanagagtt nagagcaaan aggttggttt 480 

ttcggcttcc ttaacgaaag ttttgg 506 



<210> 50 
<211> 419 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (137) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (221) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (259) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (327) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (385) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (389) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (416) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (418) 

<223> May be any nucleic acid 



<40O> 50 

tttaagcacc aaaacttgtg ttttaatgat gttggatgga aatctttcct aaatgtgtca 60 

tgcatgctct tgtctccctt aatggagaga gtgtgacact gcttagcact tggatggctt 120 

ggggtggtgg ttatgancag cagtctgtca cagctcagcg aggtgaagcc tgtgggcgtt 180 

ttgctctgtg ctgaatggct cagtggccct acaaagcgga ntcagctctt ggtggctttc 240 

tgttgtggtg ggctgctgnt gctgctgctg ctgctgctgc tgctgccctt gcctctaaaa 300 

gaactcactt cctcttcctc ctgctgncac ctgtcttttg gcttgtggga ttggagtcat 360 

ggggcccaga tggagccttg ctccntgant tatgataggc ccctcggtct cttttntnc 4 19 



<210> 51 
<211> 495 
<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<221> UNSURE 
<222> (177) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (322) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (328) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (368) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (371) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (375) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (380) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (386) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (396) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (404) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (423) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (426) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (436) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (443) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (456) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (460) 



WO 99/37660 



-194- 



PCT/US99/01313 



<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (467) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (468) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (471) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (474) 

<223> May be any nucleic acid 



<400> 51 

aattcggcac gagcaaagtt ctgcgctcca ttgtgggcat caaacgacac gtcaaagccc 60 

tccatctggg ggacacagtg gactctgatc agttcaagcg ggaggaggat ttctactaca 120 

cagaggtgca gctgaaggag gaatctgctg ctgctgctgc tgctgctgcc gcagacnccc 180 

agtccctggg actcccacct ccgagccagc tcccaccccc agcatgactg gcctgcctct 240 

gtctgctctt ccaccacctc ttgcacaaag cccagtcctc cggcccagaa catcctgggc 300 

ccggagttcc ttccttgcct tnaggggntt ttcagcaagt tnagttcctt gggtcctttt 360 

tgggaaantt naggnagttn aaggantacc aggttnttgc catnctttcc agatccaagt 420 

ttnacnaaaa attttnaaca gtntaaattg ggtttnttgn ccctttnngg nggntgtttt 4 80 

ttttttcggg tccgg 4 95 



<210> 52 
<211> 81 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (65) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (67) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (71) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (75) 

<223> May be any nucleic acid 
<400> 52 

ggcacgcagg agcagagcag cagcagcaga gagagcagca gcagcagcag cagcagcaga 60 
gagananata natanatata t 81 



<210> 53 

<211> 305 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> (11) 
<223> May be 

<220> 

<221> UNSURE 
<222> (62) 
<223> May be 

<220> 

<221> UNSURE 
<222> (81) 
<223> May be 

<220> 

<221> UNSURE 
<222> (256) 
<223> May be 

<220> 

<221> UNSURE 
<222> (289) 
<223> May be 



any nucleic acid 



any nucleic acid 



any nucleic acid 



any nucleic acid 



any nucleic acid 



<400> 53 

aggcacttga nttgaaaatg gaaaacccta ctgctggtgg tgctgcggtg atgaggccta 60 
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tnatgcagcc ccagggtttt nttaatgctc 
taagtcatca cttccgacaa cagagggtgg 
aacagcagca gcagcagcag cagcagcagc 
agcagcaaac ccaggncttc agcccacctc 
ggctt 



aaatggtcgc ccaacgcagc agagagctgc 120 
ctataatgat gcagcagcag cagcagcagc 180 
aacagcaaca gcaacagcaa cagcagcaac 240 
ctaatgtgac tgcttcccnc agcatggatg 300 

305 



<210> 54 
<211> 307 
<212> DNA 

<213> Hepatitis C virus 
<220> 

<221> UNSURE 
<222> (212) 

<223> May be any nucleic acid 



<400> 54 

tggggtgtga agctccggtg 
ccccctcagc agtctttctg 
gaggtctgga agataaccga 
ctcatcttgt gacccgaagc 
aaaatacctt caaacggtag 
cagcagc 



ctggtgcggc gggggactgc 
tcgttgccct ccacactgcg 
ttcctgggag atttgggggt 
cggcggcctt gncaggagta 
cagcagcagc agcagcagca 



ggggccagcc tcagtttaaa 60 
agactctgga gggcgatctg 120 
agtctccaat ctgtccctgg 180 
ttctagaatg agtgcacata 240 
gcagcaagca gcagcagcag 300 

307 



<210> 55 
<211> 88 
<212> DNA 
<213> Unknown 

<220> 

<221> UNSURE 
<222> (6) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (7) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
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<222> (78) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (83) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (87) 

<223> May be any nucleic acid 



<220> 

<223> Description of Unknown Organism : Unknown 
<400> 55 

ggacanngac tactctctct ctctctctct ctctctctgc tgctgctgct gtgctgctgc 60 
tgctgctgct gctgccgntg tgngcana 88 



<210> 56 
<211> 346 
<212> DNA 
<213> Unknown 

<220> 

<221> UNSURE 
<222> (278) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (288) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (299) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (313) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 
<220> 

<223> Description of Unknown Organism: Unknown 
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<400> 56 

ggcacagccc aactggtgat gctgctgctg ctgctgctgc tgccgccgcc gcctctattg 60 
ctgatactct agtggggctg gaagggtggt tcctattcgc accatcgcca accagagaca 120 
gagggaaaaa aaaaaccggc agccactgct gaatgttggg ttcggaggct gcatccgact 180 
cggtcacaag gaaaatggat tcagtttgca tctctccctc ctttaaacag cttctccggg 240 
tctcagcatg ggcttccagg gcagcgattg aggagacntt accaaggngc accacacant 300 
agatgctgag acntcgtgac tccaggataa gaaacattaa cngggg 34 6 



<210> 57 
<211> 496 
<212> DNA 
<213> Unknown 



<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (11) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (78) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (195) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (197) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (286) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (291) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (293) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (31S) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (328) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (329) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (344) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (346) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (352) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (354) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (358) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (366) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (399) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (406) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (410) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (418) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (420) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (435) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (443) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (453) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (454) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (459) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (471) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (473) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (474) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (481) 

<223> May be any nucleic acid 
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<400> 57 

gaattcggca naggtgcaca gatgtggtgg atggggaggg ccgcacggga cagaagttct 60 
ccctgtgtat tctgacgnct gagaaaggag catttcatcc gggcggagac caaggagatc 120 
gtcaatgggt ggctggagat gctcatggtc tatccccgga ccaacaagca gaatcagaag 180 
aagaaacgga aagtngnagc cccccacacc acaggagcct gggactgcca agttgggctg 24 0 
ttaccagcag cagcagcagc agcagcagca gcagcagcat ccccantgct ntnggaaagt 300 
tcccaccacc aagtnccaca atttgggnna aaaccaaggt tgtngnagac gngntttngg 360 
gatttnggca ttgtgggttg cttgcatgga aggacattng gttgtnggtn ccttggangn 420 
tacaattacc atttncggtt gtnaaggtta aanntccgnc attcagaagg ntnnaaggtg 480 
ntttgaagtc catttg 496 

<210> 58 
<211> 268 
<212> DNA 

<213> Drosophila sp , 
<220> 

<221> UNSURE 
<222> (16) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (51) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (60) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (202) 

<223> May be any nucleic acid 
<400> 58 

aacacttatc cttganagct ctgtttggga agcaggacaa agctacatgt naggaaactn 60 
tggagcctcc gcagactctc caccagcagc agcagcagca gcagcagcag caagagaagc 120 
ttccaattag gcagggggtt gtacgctccc tgtcctatga ggaacccaga agacactcac 180 
cccccattga gaagcagctc tntccagcca ttcagaaact catggtcagg agcgcagacc 24 0 
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tccacccatt gtcagagctg cctgaaaa 268 



<210> 59 

<211> 471 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (249) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (386) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (449) 

<223> May be any nucleic acid 
<400> 59 

tcgacccacg cgtccgctga ggaacagacg ttccctggcg gccctggcgc cttcaaaccc 60 
agacatgctg ctgctgctgc tgctgctgcc cctgctctgg gggacaaagg ggatggaggg 120 
agacagacaa tatggggatg gttacttgct gcaagtgcag gagctggtga cggtgcagga 180 
gggcctgtgt gtccatgtgc cctgctcctt ctcctacccc caggatggct ggactgactc 240 
tgacccagnt catggctact ggttccgggc aggagacaga ccataccaag acgctccagt 300 
ggccacaaac aacccagaca gagaagtgca ggcagagacc cagggccgat tccaactcct 360 
tggggacatt tggagcaacg actgcnccct gagcatcaga gacc/ccagga agagggataa 420 
ggggtcatat ttctttcggc tagagagang aagcatgaaa tggagttaca a 471 



<210> 60 
<211> 379 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<220> 

<221> UNSURE 
<222> (2) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (14) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (135) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (315) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (332) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (349) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (357) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (374) 

<223> May be any nucleic acid 



<400> 60 

anttcggcan aggnaaggga gagggtgacc ngcatcccaa ctagatttca gtggagtgaa 60 

gttcaggagg catggagctg acaaccatga ggcctcggca gccaccgcca ccaccgccgc 120 

cgccaccacc gtagncagca gcagcagcag cagcagcagc aagagttaac tctgacttag 180 

ggaatagaga cagccagaga gaaatgtgat caatgaagga gacatctgga gtgtgcgtgc 240 

ttcttcagag gggacgggtg atgggcagat ttggaaaaag caccgcagat tgggaacctt 300 

atcttttctt tttcntaaaa ttgttgttat gnaaatttgg gtttttccng taacttntta 360 
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aaaacttaaa agtnggttt 379 



<210> 61 
<211> 255 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<220> 

<221> UNSURE 
<222> (121) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (183) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (254) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (255) 

<223> May be any nucleic acid 



<400> 61 

aattccgaca atggaaagca ctcttagcct tgcagtggtc tacattttta aggaaccaat 60 
atttcagcat tctttattac ccggcacgct gtgtcctttg tcagagttca agtttatggt 120 
nactgccagg gtcagacagt ccatttgctg ctgctgctgc tgctgctgct ttctcgaact 180 
ggnatggcat tagggaagct gctgtctgag tgttagggaa tgtcttggct aagtaaagcc 240 
aatgttcttt cctnn 255 



<210> 62 
<211> 5289 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 



<400> 62 

cgagctctcc cagccgcagc ctccgaatcc acggcctcca ccccgcgcct ctccagcgct 60 
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ctatcccgtc gctgcgccct tgtcgccggc cccggccgct gcatccgcgt ccgcacaggc 120 
tccttgctgg gcacaaatag ctccaccatg gggctggcct ggggactcgg tgtcctgctc 180 
ctgttgcatg cctgcggctc caaccgcatt ccagagtctg ggggagacaa cagtgtgttt 240 
gacatctttg aactcaccgg agctgcccgc aagcggtctg ggcgccgact ggtgaagggc 300 
cctgaccctt ctagcccagc tttccgcatc gaggatgcca acctgatccc ccctgtgcct 360 
gacaagaagt tccaagacct agtggatgct gtgcgggcgg agaaaggttt cctcctcctg 420 
gcctccctga ggcaaatgaa gaagacccgg ggtaccctgc tggctgtgga gcggaaagac 480 
cactctggcc aggtcttcag cgtgatct cc aatggcaagg cgggcaccct ggacctgagc 54 0 
ctgaccgtgc aggggaagca gcatgtggtg tcggtggaag aagcactcct ggcgactggc 600 
cagtggaaga gcatcaccct gtttgtgcag gaggacaggg cccagctgta catcgactgt 660 
gagaagatgg agaatgcgga gctggatgtc cccatccaga gcatcttcac cagggacctg 720 
gccagcatcg ccaggctccg cattgccaaa ggaggtgtca acgacaattt ccagggggtg 780 
ctgcagaatg taaggtttgt ctttggaacc acaccagaag acatcctcag gaacaaaggc 840 
tgctccagct ctaccagtgt ctttgtcacc cttgacaaca acgtggtgaa tgggtccagc 900 
cctgccatcc gcaccgacta cattggccac aagacaaagg acctgcaagc catctgtggc 960 
atctcatgtg acgagctgtc cagcatggtc ctggagctca ggggtctacg caccatcgtg 1020 
accacgctgc aggacagtat ccgcaaagtg accgaagaga acaaagagct ggccaacgag 1080 
ctgaggaggc ccccactctg ctaccacaac ggagtgcagt acaggactgg cgacgagtgg 1140 
acggtggaca gctgcactga gtgtcgctgc cagaactcag ttaccatctg caaaaaagtg 1200 
tcctgtccca tcatgccctg ctccaatgcc acagttccgg atggagaatg ctgcccacgg 1260 
tgctggccca gcgactctgc agacgatggc tggtccccgt ggtctgagtg gacctcttgc 1320 
tctgtgacct gtggcaatgg aatccagcag cgtggccgct cctgcgacag cctcaacaac 1380 
agatgcgagg gctcctctgt gcagacgegg acctgccaca tccaggagtg tgacaagaga 14 4 0 
tttaaacagg atggcggctg gagccactgg tccccatggt catcttgctc cgtaacatgt 1500 
ggagacggtg tgatcacaag gatccggctc tgcaactccc ccagccccca gatgaatggg 1560 
aagccatgtg agggcaaagc ccgggagacc aaagcctgcc agaaagactc ctgccccatc 1620 
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aatggaggct ggggaccttg gtcaccatgg gacatctgtt ctgtcacctg tggaggaggg 1680 
gtacagaaac gtagccggct ctgcaacaac cccaaacccc agtttggagg caaggactgc 174 0 
gttggtgatg tgacagaaaa ccagatctgc aacaagcagg actgtcccat tgacggatgc 1800 
ctgtccaatc cctgctttgc tggtgtccag tgtaccagct accctgatgg cagctggaag 18 60 
tgtggtgcct gtcccccagg ctatagtgga gatggagtcg agtgcaaaga cgttgatgag 1920 
tgcaaagaag tccctgatgc ctgcttcaac cacaatggag agcacaggtg tgagaacaca 1980 
gaccccggct acaactgcct gccctgccca ccgcgcttca ctggctcgca gccctttggc 204 0 
cggggcgtgg aacatgccac cgccaacaag caggtatgca agccccgaaa cccctgcaca 2100 
gacgggacac acgactgcaa caagaacgcc aagtgcaact acctgggcca ctacagcgac 2160 
cccatgtacc gctgcgagtg caagcctggc tacgccggca acggcatcat ctgcggggag 2220 
gacacagacc tggacggctg gcccaatgag gacctgctgt gcgtggccaa cgcaacttac 2280 
cactgcagaa aggataattg ccccaacctt cccaactcag ggcaggaaga ctatgacaag 2340 
gatggaatcg gcgatgcctg cgatgatgac gatgacaatg ataagattcc agatgacagg 24 00 
gacaactgtc cattccatta caacccagcc cagtacgact atgacagaga tgacgtggga 24 60 
gaccgctgtg acaactgccc ctacaaccac aacccagacc aggctgacac agataacaat 2520 
ggggaaggag acgcctgtgc agctgacatt gatggggaca gtatcctcaa tgaacgggac 2580 
aactgccagt atgtctacaa tgtggaccag aaagacactg acatggacgg ggttggtgat 2640 
cagtgtgaca actgccccct ggaacacaat ccagaccagc tcgactctga ctcggaccgc 2700 
attggagaca cctgtgacaa caatcaggat attgatgaag acggccacca gaacaatctg 2760 
gacaactgtc cctacgtgcc caacgccaac caggctgacc atgacaagga tggcaaaggc 2820 
gatgcctgtg accatgatga cgacaatgat ggcattcctg atgaccggga caactgcagg 2880 
ctggtgccca atcctgacca gaaggactct gatggtgatg gtcgaggtga tgcttgcaaa 2940 
gatgattttg accaggacaa ggtgccagac attgatgaca tctgtcccga aaatgttgat 3000 
atcagtgaga ctgatttccg ccgattccag atgattcctc tagatcccaa agggacatcc 3060 
cagaatgacc ctaactgggt tgtacgccat cagggtaaag aactcgtcca gactgtcaac 3120 
tgtgaccctg gacttgctgt aggttatgac gaatttaacg ccgtggactt cagtggcacc 3180 
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ttcttcatca acaccgagag ggatgacgac tatgccggct ttgtgtttgg ctaccagtcc 3240 

agcagccgct tctatgttgt gatgtggaag caagtcactc agtcctactg ggacaccaac 3300 

cccacgaggg ctcaggggta ctctggactt tccgtgaagg ttgt'aaactc caccacgggg 3360 

cctggcgagc acctgcggaa tgccctgtgg cacacaggaa acacctctgg ccaggtgcgc 34 20 

acactgtggc atgaccctcg tcacattggc tggaaagatt tcactgccta cagatggcat 34 80 

ctgagccaca ggccaaagac aggtttcatc agagtggtaa tgtatgaagg gaagaaaatc 3540 

atggctgact caggacccat ctatgacaaa acctatgctg gtgggaggct aggcttgttc 3600 

gtcttctctc aagaaatggt gttcttctcc gacctgaaat atgaatgcag agactcctaa 3660 

tcatcaaact gttgatcaaa agactgatca taaaccaatg ctggtattgc accttctgga 3720 

accatgggct tagaaaaccc ccaggatcgc gcctcgctgc ctgcctttgc tctctgcttg 3780 

catgagtgtg gactcctaga acatgtgact tgcctcaaga aaatgcaatt ttccaaatca 3840 

gaccctgcat tcagcctctg actgagaaga atcttccaag gagacaaaca atgactttgg 3900 

ttggcttttg caaaagcaaa agcatccaca tgctttggtt ggaaggtgcc tgtcccactc 3960 

tgcttttgtc agagcagaat gcgactgtga ggccagctct gagcagtgga ctccaaaatg 4 020 

ttttcaggca tgtgagagaa gggaggactc actagaattg acaaacaaaa ccagccctga 4080 

cctactccct ctggaatggg ggcgggtggg ggggccaaag cccaaagggg aggatgcata 4140 

cccaagagat gattgtatga agaaaatatg gaggaactgt tacatttttg gtactaaatc 4200 

attttcaggg gattgaaaga ctattgctgg atttcatgat gctgaccggt gttagctgat 4260 

taacccacat aaataggcac ttaaatagga gcagggaagg aaggaaaaga ctggcttctg 4 320 

gacttcctcc cagatttcca ccccttaaca ^catcacctgt agtgaccaga acagggagtc 4 380 

ggagttaaac cgacacaagg cagggccagc tgctgcagct tggttctatt gaaattgtca 4440 

gttgtattcc agatgtagct tctgcagatg tagcagcaaa ataagaatac ccaccatctc 4 500 

agcgagcacc aggctgtctc ccaagggacg gcagccatgc ttgtattttt atggttagaa 4 560 

aggcacaaaa ttatcaacta agacattcct tctttctctt tttttcctga acatcatgga 4 620 

gttttccagt tgtctctttt ggactgtagt ttttagtgtt ttaaacaaac actttacaat 4680 

gtaaactatt tattttttac ttattctggg ggatctgtct gaaagactat tcatggaaca 4740 
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ggaagaagcg taaggactat ccatatcatc tttgctacaa gtcattatga ctgtaagatt 4800 
gtaaatacag attatttatt aactctgttc tacctggaat ctagtttcat atggaaagtg 4860 
tttgagagca ggtagttgag atcgatcagc aaatctttca caggaatggc acaaggaaac 4 920 
cagcatagca agctgctctt caccttgtgc ttagactgga tgatttggaa ttcttttttc 4 980 
cttttttttc ccaagtggaa ttacttggtt gtccatttgc aagtgttttt agtttgcaaa 5040 
gaaagccaag aggccattaa tactgtctta tcccatccct tgtgcctatt tccagggaga 5100 
tgaaaagcat ctacatttat tatttttgcc tttttccaaa agaaaaaaat gacaaaggtg 5160 
aaacttgtat acaaatatta cctcatttgt tgtgtgactg agtaaagaat tttgggatca 5220 
aacagaaaga gtttaagtgt ctaacaaact taaagctact gtagtaccta aaaaaaaaaa 5280 
aaaaaaaaa 5289 

<210> 63 
<211> 2053 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 63 

gaattccggc ggccgctgag agcccaccct ggcgagctct cccagccgca gcctccgaat 60 
ccacggcctc caccccgcgc ctctccagcg ctctatcccg tcgctgcgcc cttgtcgccg 120 

gccccggcgc tgcatccgcg tccgcacagg ctccttgact gggcacaaat agctccacca 180 

tggggctggc ctggggactc ggtgtcctgc tcctgttgca tgcctgcggc tccaaccgca 24 0 

ttccagagtc tgggggagac aacagtgtgt ttgacatctt tgaactcacc ggagctgccc 300 

gcaacggtac tgggcgccga ctggtgaagcf gccctgaccc ttctagccca gctttccgca 360 

tcgaggatgc caacctgatc ccccctgtgc ctgacaagaa gttccaagac ctagtggatg 420 

ctgtgcgggc ggagaaaggt ttcctcctcc tggcctccct gaggcaaatg aagaagaccc 4 80 

ggggtaccct gctggctgtg gagcggaaag accactctgg ccaggtcttc agcgtgatct 54 0 

ccaatggcaa ggcgggcacc ctggacctga gcctgaccgt gcaggggaag cagcatctgg 600 

tgtcggtgga agaagcactc ctggcgactg gccagtggaa gagcatcacc ctgtttgtgc 660 
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aggaggacag ggcccagctg tacatcgact gtgagaagat ggagaatgcg gagctggatg 720 
tccccatcca gagcatcttc accagggacc tggccagcat cgccaggctc cgcattgcca 780 
aaggaggtgt caacgacaat ttccaggggg tcctgcagaa tgtaaggttt gtctttggaa 84 0 
ccacaccaga agacatcctc aggaacaaag gctgctccag ctctaccagt gtctttgtca 900 
cccttgacaa caacgtggtg aatgggtcca gccctgccat ccgcaccgac tacattggcc 960 
acaagacaaa ggacctgcaa gccatctgtg gcatctcatg tgacgagctg tccagcatgg 1020 
tcctggagct caggggtcta cgcaccatcg tgaccacgct gcaggacagt atccgcaaag 1080 
tgaccgaaga gaacaaagag ctggccaacg agctgaggag gcccccactc tgctaccaca 114 0 
acggagtgca gtacaggact ggcgacgagt ggacggtgga cagctgcact gagtgtcgct 1200 
gccagaactc agttaccatc tgcaaaaaag tgtcctgtcc catcatgccc tgctccaatg 1260 
ccacagttcc ggatggagaa tgctgcccac ggtgctggcc cagcgactct gcagacgacg 1320 
gctggtcccc gtggtctgag tggacctctt gctctgtgac ctgtggcaat ggaatccagc 1380 
agctggccgc tcctgcgaca gcctcaacaa cagatgcgag ggctcctctg tgcagacgcg 14 4 0 
gacctgccac atccaggagt gtgacaagag atttaaacag gatggcggct ggagccactg 1500 
gtccccatgg tcatcttgct ccgtaacatg tggagacggt gtgatcacaa ggatccggct 1560 
ctgcaactcc cccagccccc agatgaatgg gaagccatgt gagggcaaag cccgggagac 1620 
caaagcctgc cagaaagact cctgccccat caatggaggc tggggacctt ggtcaccatg 1680 
ggacatctgt tctgtcacct gtggaggagg ggtacagaaa cgtagccggc tctgcaacaa 17 4 0 
ccccacaccc cagtttggag gcaaggactg cattggtgat gtgacagaaa accagatctg 1800 
caacaagcag gactgtccca ttgacggatg cctgtccaat ccctgctttg ctggtgtcca 1860 
gtgtaccagc taccctgatg gcagctggaa gtgtggtgcc tgtcccccag gctatagtgg 1920 
agatggagtc gagtgcaaag acgttgatga gtgcaaagaa gtccctgatg cctgcttcaa 1980 
ccacaatgga gagcacaggt gtgagaacac agaccccggc tacaactgcc tgccctgccc 204 0 
accgcccgga att 2053 



<210> 64 
<211> 4339 
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<212> DMA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 64 

agccactgcc tggagtcagc cagcctcatc ggacttctgc aggcaatcgc gaagctgcta 60 
tccagttctg ccacggtctc tcccggcgca ccggcagtct cagcgtcttc accggactca 120 
gcgtccttgt ccttcacttc acctttgcca cctctccggg ttactgagcc ccggtgcaca 180 
caggctccgt gttgggcaca aaggctccac catggagctc ctgcggggac taggtgtcct 24 0 
gttcctgttg catatgtgtg gaagcaaccg cattccagag tctgggggag ataacggtgt 300 
gtttgacatc tttgaactca ttggaggtgc acgaaggggc cccggtcgcc gactggtgaa 360 
gggccaagat ctatccagcc ccgccttccg cattgagaat gccaacctga tccccgctgt 420 
gccggatgac aagttccaag acctactgga cgctgtgtgg gccgacaaag gcttcatctt 480 
cttggcttcc ttgaggcaga tgaagaagac ccggggcaca ctcctggctg tggaacggaa 540 
agacaacact ggccagatct tcagtgtggt ctccaacggc aaagctggca ccctggacct 600 
gagcctgagc ctgccaggga agcaacaagt ggtgtcagtg gaggaagctc tcctggccac 660 
tggccagtgg aagagcatca cgctgtttgt tcaagaggac cgggctcaac tctacataga 720 
ctgtgataag atggagagcg cggagctgga tgtacccatc cagagcatct tcaccaggga 780 
tctggccagc gttgccaggc tccgagttgc aaagggagat gtcaatgaca attttcaggg 84 0 
ggtgctgcag aatgtgaggt ttgtctttgg aaccacccca gaagacattc tcaggaacaa 900 
aggctgctcc agctctacca acgtccttct tacccttgac aacaacgtgg tgaacggttc 960 
cagccctgct atccgcacca actacatcgg ccacaaaaca aaggacctcc aagctatctg 1020 
tggcctctcc tgtgatgaac tatccagcat ggtcctggaa ctgaagggcc tgcgcaccat 1080 
cgtgaccact ctgcaggaca gcatccgaaa agtgacggaa gagaacagag agctggtcag 114 0 
tgagctgaag cggcctcccc tctgctttca caatggagtc cagtacaaga * acaacgagga 1200 
gtggactgta gacagttgca cagagtgtca ctgccagaac tcggttacca tctgcaaaaa 12 60 
ggtgtcctgt cccatcatgc cctgctccaa cgccacagtt cctgatggtg aatgctgccc 1320 
acggtgctgg cccagcgact ctgctgacga tggctggtct ccctggtctg agtggacctc 1380 
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ctgctctgcc acatgtggca atggaattca gcaacgtggt cgttcctgtg acagcctcaa 1440 

caacagatgc gagggctctt cggtacagac gaggacctgc cacattcagg agtgtgacaa 1500 

aagatttaaa caggatggtg gctggagtca ctggtctcca tggtcgtcct gttctgtgac 1560 

ctgtggtgac ggtgtgatca caaggatccg tctctgcaac tcccccagcc cccagatgaa 1620 

cgggaagccc tgtgaaggtg aagcccggga gaccaaagcc tgcaagaaag acgcctgccc 1680 

aattaatgga ggctggggtc cctggtcacc atgggacatc tgctctgtca cctgtggagg 1740 

aggagtgcag agacgcagcc gactctgtaa caaccccaca ccccagtttg gaggcaaaga 1800 

ctgtgttggc gatgtgacag aaaatcaagt ttgcaacaag caggactgcc caattgatgg 1860 

atgcctgtcc aatccctgct ttgctggtgc caagtgtact agctaccctg atggtagctg 1920 

gaaatgtggt gcgtgtcctc ctggctacag tggaaatggc atccagtgca aagacgtcga 1980 

tgagtgcaaa gaagtgcctg atgcttgctt caatcacaac ggagaacatc ggtgcaagaa 2040 

cacagatcct ggctacaact gcctgccctg cccaccacga ttcactggct cacagccctt 2100 

cggccgaggt gtcgaacatg ccatggccaa caaacaggtg tgcaaaccgc gaaacccctg 2160 

cacggacggg acgcatgact gcaacaagaa cgctaagtgc aactacctgg gtcactacag 2220 

cgaccccatg taccgctgtg agtgcaagcc cggctatgca ggcaatggca tcatctgcgg 2280 

agaggacaca gacctggacg gctggcctaa tgaaaacctg gtgtgtgtgg ccaacgcaac 234 0 

ctaccactgc aaaaaggaca actgccccaa ccttcccaac tcggggcagg aagactatga 24 00 

caaggacggg attggcgatg cctgcgatga tgacgatgac aacgacaaga tccctgatga 24 60 

cagggacaac tgtccattcc attacaaccc agcccagtat gactatgaca gagatgatgt 2520 

gggagaccgc tgtgacaact gcccctacaa ccacaaccct gaccaagcag acacagacaa 2580 

aaacggggag ggcgatgcct gtgctgtgga catcgatgga gatggaatcc tcaatgaacg 2640 

agacaactgc cagtacgttt acaacgtgga ccagagggac acggacatgg atggggttgg 2700 

agatcagtgt gacaactgcc ccctggaaca caatccagac cagctggact ctgactcaga 2760 

cctcataggg gacacttgtg acaacaatca ggacatcgat gaggatggcc atcagaacaa 2820 

cctggacaac tgtccctatg tgcctaacgc caaccaggcc gaccatgata aagatggcaa 2880 

aggagatgcc tgtgaccatg acgatgacaa tgacggcatc cctgatgaca gagacaactg 2940 
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caggctggtg cccaatcctg accagaagga ctctgatggt 
caaagacgac tttgaccatg acaatgtgcc agatattgat 
tgacatcagt gaaaccgatt tccgacgatt ccagatgatt 
ctcccaaaat gaccctaact gggttgtccg ccatcagggc 
aaactgtgac cctggaettg ctgtaggtta tgatgagttt 
taccttcttc atcaacaccg agagagatga tgactacgct 
gtccagcagc cgcttctacg ttgtgatgtg gaaacaagtc 
caaccccaca agggctcagg gatactcagg cctgtctgta 
cggccctggc gagcacctgc ggaatgcact gtggcacaca 
gcgcaccctg tggcatgacc ctcgccacat cggctggaaa 
gcgtctcagc cacaggccaa agaccggtta tatcagagtg 
aatcatggct gactcgggac ccatctatga caaaacctac 
gttcgtcttc tctcaggaaa tggtgttctt ctcagacatg 
ctaatcatca gctgccaatc ataaccagcg ctggcaatgc 
agagaaaccc cccacccctg ccgggatcgc ctttcctcgc 
catagtgtgg acttgtaaag cctgagacct gcctcaagaa 
agtcagcact cggcctttaa cgaatgagaa tgcatcttcc 
gggtttgctt ttgggaaagc caaagcgcct atttacttcc 
cactctgcct tactcacaga gccagaactt cttcgaggcc 
gaagcatttt caggcatgtc aaagaaagga aaaatgactc 
aacctctgac ataggtcctg agatgtgggg aggcaggagc 
gtacccaaga gatgactgta tgaagaaaat gtggaggagc 
tcaggggaca gacagacttg ctgcatttcc gcatgctgct 
atcttccaca caggcactt 

<210> 65 
<211> 186 



gatggccgag gtgacgcctg 3000 
gacatctgtc ctgagaattt 3060 
cctctagatc ccaaaggaac 3120 
aaagaactcg tccagactgt 3180 
aatgctgtgg acttcagcgg 3240 
ggcttggtat tcggctacca 3300 
acccagtcct actgggacac 3360 
aaggttgtga actccaccac 34 20 
ggaaacaccc ctggccaggt 34 80 
gatttcactg cgtacagatg 3540 
gtgatgtatg aaggaaagaa 3600 
gccggcggta gactaggcct 3660 
aaatacgagt gtcgagattc 3720 
accttctaaa aacaagggct 3780 
cttccttgcc tctcttcttg 3840 
aatgcagttt tcgaacccag 3900 
aagaccatga agagttcctt 3960 
cactaggaag gtgcccgctc 4020 
acctctgagc agcacacaca 4080 
actagaactc accgccaaac 4140 
caaagctcta gggagggcat 4200 
tgttcggtac taaatcattt 4260 
ggtgagagct gattgaccca 4320 

4339 
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<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 65 

gcacagttaa tggaggctgg ggtccctggt caccatggga catctgctct gtcacctgtg 60 
gaggaggagt gcagagacgc agccgactct gtaacaaccc cacaccccag tttggaggca 120 
aagactgtgt tggcgatgtg acagaaaatc aagtttgcaa caagcaggac tgcccaattg 180 
gtaagc 186 

<210> 66 
<211> 5774 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 66 

gtcactttgg ttgatagcag ccgctctggt agaggttagg acttcagctg atggacaagc 60 
tggtaatgaa gaaatggtgc aaatagattt accaataaag agatatagag agtatgagct 120 
ggtgactcca gtcagcacaa atctagaagg acgctatctc tcccatactc tttctgcgag 180 
tcacaaaaag aggtcagcga gggacgtgtc ttccaaccct gagcagttgt tctttaacat 240 
cacggcattt ggaaaagatt ttcatctgcg actaaagccc aacactcaac tagtagctcc 300 
tggggctgtt gtggagtggc atgagacatc tctggtgcct gggaatataa ccgatcccat 360 
taacaaccat caaccaggaa gtgctacgta tagaatccgg aaaacagagc ctttgcagac 420 
taactgtgct tatgttggtg acatcgtgga cattccagga acctctgttg ccatcagcaa 4 80 
ctgtgatggt ctggctggaa tgataaaaag tgataatgaa gagtatttca ttgaaccctt 540 
ggaaagaggt aaacagatgg aggaagaaaa aggaaggatt catgttgtct acaagagatc 600 
agctgtagaa caggctccca tagacatgtc caaagacttc cactacagag agtcggacct 660 
ggaaggcctt gatgatctag gtactgttta tggcaacatc caccagcagc tgaatgaaac 720 
aatgagacgc cgcagacacg cgggagaaaa cgattacaat atcgaggtac tgctgggagt 780 
ggatgactct gtggtccgtt tccatggcaa agagcacgtc caaaactacc tcctgaccct 8 40 



WO 99/37660 PCT/US99/01313 

-214- 

aatgaacatt gtgaatgaaa tttaccatga tgagtccctc ggagtgcata taaatgtggt 900 
cctggtgcgc atgataatgc tgggatatgc aaagtccatc agcctcatag aaaggggaaa 960 
cccatccaga agcttggaga atgtgtgtcg ctgggcgtcc caacagcaaa gatctgatct 102,0 
caaccactct gaacaccatg accatgcaat ttttttaacc aggcaagact ttggacctgc 1080 
tggaatgcaa ggatatgctc cagtcaccgg catgtgtcat ccagtgagaa gttgtaccct 1140 
gaatcatgag gatggttttt catctgcttt tgtagtagcc catgaaacgg gccatgtgtt 1200 
gggaatggag catgatggac aaggcaacag gtgtggtgat gagactgcta tgggaagtgt 1260 
catggctccc ttggtacaag cagcattcca tcgttaccac tggtcccgat gcagtggtca 1320 
agaactgaaa agatatatcc attcctatga ctgtctcctt gatgaccctt ttgatcatga 1380 
ttggcctaaa ctcccagaac ttcctggaat caattattct atggatgagc aatgtcgttt 1440 
tgattttggt gttggctata aaatgtgcac cgcgttccga acctttgacc catgtaaaca 1500 
gctgtggtgt agccatcctg ataatcccta cttttgtaag actaaaaagg gacctccact 1560 
tgatgggact gaatgtgctg ctggaaaatg gtgctataag ggtcattgca tgtggaagaa 1620 
tgctaatcag caaaaacaag atggcaattg ggggtcatgg actaaatttg gctcctgttc 1680 
tcggacatgt ggaactggtg ttcgtttcag aacacgccag tgcaataatc ccatgcccat 1740 
caatggtggt caggattgtc ctggtgttaa ttttgagtac cagctttgta acacagaaga 1800 
atgccaaaaa cactttgagg acttcagagc acagcagtgt cagcagcgaa actcccactt 1860 
tgaataccag aataccaaac accactggtt gccatatgaa catcctgacc ccaagaaaag 1920 
atgccacctt tactgtcagt ccaaggagac tggagatgtt gcttacatga aacaactggt 1980 
gcatgatgga acgcactgtt cttacaaaga tccatatagc atatgtgtgc gaggagagtg 2040 
tgtgaaagtg ggctgtgata aagaaattgg ttctaataag gttgaggata agtgtggtgt 2100 
ctgtggagga gataattccc actgccgaac cgtgaagggg acatttacca gaactcccag 2160 
gaagcttggg taccttaaga tgtttgatat accccctggg gctagacatg tgttaatcca 2220 
agaagacgag gcttctcctc atattcttgc tattaagaac caggctacag gccattatat 2280 
tttaaatggc aaaggggagg aagccaagtc gcggaccttc atagatcttg gtgtggagtg 2340 
ggattataac attgaagatg acattgaaag tcttcacacc gatggacctt tacatgatcc 2400 



WO 99/37660 PCT/US99/01313 

-215- 

tgttattgtt ttgattatac ctcaagaaaa tgatacccgc tctagcctga catataagta 24 60 
catcatccat gaagactctg tacctacaat caacagcaac aatgtcatcc aggaagaatt 2520 
agatactttt gagtgggctt tgaagagctg gtctcaggtt tccaaaccct gtggtggagg 2580 
tttccagtac actaaatatg gatgccgtag gaaaagtgat aataaaatgg tccatcgcag 264 0 
cttctgtgag gccaacaaaa agccgaaacc tattagacga atgtgcaata ttcaagagtg 2700 
tacacatcca ctctgggtag cagaagaatg ggaacactgc accaaaacct gtggaagttc 2760 
tggctatcag cttcgcactg tacgctgcct tcagccactc cttgatggca ccaaccgctc 2820 
tgtgcacagc aaatactgca tgggtgaccg tcccgagagc cgccggccct gtaacagagt 2880 
gccctgccct gcacagtgga aaacaggacc ctggagtgag tgttcagtga cctgcggtga 294 0 
aggaacggag gtgaggcagg tcctctgcag ggctggggac cactgtgatg gtgaaaagcc 3000 
tgagtcggtc agagcctgtc aactgcctcc ttgtaatgat gaaccatgtt tgggagacaa 3060 
gtccatattc tgtcaaatgg aagtgttggc acgatactgc tccataccag gttataacaa 3120 
gttatgtztgt gagtcctgca gcaagcgcag tagcaccctg ccaccaccat accttctaga 3180 
agctgctgaa actcatgatg atgtcatctc taaccctagt gacctcccta gatctctagt 324 0 
gatgcctaca tctttggttc cttatcattc agagacccct gcaaagaaga tgtctttgag 3300 
tagcatctct tcagtgggag gtccaaatgc atatgctgct ttcaggccaa acagtaaacc 3360 
tgatggtgct aatttacgcc agaggagtgc tcagcaagca ggaagtaaga ctgtgagact 3420 
ggtcaccgta ccatcctccc cacccaccaa gagggtccac ctcagttcag cttcacaaat 3480 
ggctgctgct tccttctttg cagccagtga ttcaataggt gcttcttctc aggcaagaac 3540 
ctcaaagaaa gatggaaaga tcattgacaa cagacgtccg acaagatcat ccaccttaga 3600 
aagatgagaa agtgaaccaa aaaggctaga* aaccagagga aaacctggac aacctctctc 3660 
ttcccatggt gcatatgctt gtttaaagtg gaaatctcta tagatcgtca gctcatttta 3720 
tctgtaattg gaagaacaga aagtgctggc tcactttcta gttgctttca tcctcctttt 3780 
gttctgcatt gactcattta ccagaattca ttggaagaaa tcaccaaaga ttattacaaa 384 0 
agaaaaatat gttgctaaga ttgtgttggt cgctctctga agcagaaaag ggactggaac 3900 
caattgtgca tatcagctga ctttttgttt gttttagaaa agttacagta aaaattaaaa 3960 
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agagatacca atggtttaca ctttaacaag aaattttgga tatggaacaa agaattctta 4020 
gacttgtatt cctatttatc tatattagaa atattgtatg agcaaatttg cagctgttgt 4080 
gtaaatactg tatattgcaa aaatcagtat tattttaaga gatgtgttct caaatgattg 4140 
tttactatat tacatttctg gatgttctag gtgcctgtcg ttgagtattg ccttgtttga 4200 
cattctatag gttaattttc aaagcagagt attacaaaag agaagttaga attacagcta 4260 
ctgacaatat aaagggtttt gttgaatcaa caatgtgata cgtaaattat agaaaaagaa 4320 
aagaaacaca aaagctatag atatacagat atcagcttac ctattgcctt ctatacttat 4380 
aatttaaagg attggtgtct tagtacactt gtggtcacag ggatcaacga atagtaaata 4440 
atgaactcgt gcaagacaaa actgaaaccc tctttccagg acctcagtag gcaccgttga 4 500 
ggtgtccttt gtttttgtgt gtgtgtgttc ttttttaatt ttcgcattgt tgacagatac 4560 
aaacagutat actcaatgta ctgtaataat cgcaaaggaa aaagttttgg gataacttat 4 620 
ttgtatgttg gtagctgaga aaaatatcat cagtctagaa ttgatatttg agtatagtag 4 680 
agctttgggg ctttgaaggc aggttcaaga aagcatatgt cgatggttga gatatttatt 4740 
ttccatatgg ttcatgttca aatgttcaca accacaatgc atctgactgc aataatgtgc 4800 
taataattta tgtcagtagt caccttgctc acagcaaagc cagaaatgct ctctccaggg 4860 
agtagatgta aagtacttgt acatagaatt cagaactgaa gatatttatt aaaagttgat 4 920 
ttttttttct tgatagtatt tttatgtact aaatatttac actaatatca attacatatt 4980 
ttggtaaact agagagacat aattagagat gcatgctttg ttctgtgcat agagaccttt 5040 
aagcaaacta ctacagccaa ctcaaaagct aaaactgaac aaatttgatg ttatgcaaac 5100 
atcttgcatt tttagtagtt gatattaagt tgatgacttg tttcccttca aggaaacatt 5160 
aaattgtatg gactcagcta gctgttcaat gaaattgtga attagaaaca tttttaaaag 5220 
tttttgaaag agataagtgc atcatgaatt acatgtacat gagaggagat agtgatatca 5280 
gcataatgat tttgaggtca gtacctgagc tgtctaaaaa tatattatac aaactaaaat 5340 
gtagatgaat taacctctca aagcacagaa tgtgcaagaa cttttgcatt ttaatcgttg 5400 
taaactaaca gcttaaacta ttgactctat acctctaaag aattgctgct actttgtgca 5460 
agaactttga aggtcaaatt aggcaaattc cagatagtaa aacaatccct aagccttaag 5520 
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tctttttttt ttcctaaaaa ttcccataga ataaaattct ctctagttta cttgtgtgtg 5580 

catacatctc atccacaggg gaagataaag atggtcacac aaacagtttc cataaagatg 5640 

tacatattca ttatacttct gacctttggg ctttcttttc tactaagcta aaaattcctt 5700 

tttatcaaag tgtacactac tgatgctgtt tgttgtactg agagcacgta ccaataaaaa 57 60 
tgttaacaaa atat 5774 

<210> 67 

<211> 5535 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<400> 67 

ggactttaga agccgttgct gccctctctg tcacctgaag cggggccctc tcccatccca 60 
cccttgcccc gcctccctgc ccccaccggg ccggccctgc ccgccgccgg accctggcat 120 
gtcaagacct ggtccgcgcc tgcctgccca gcccgcggaa ccccggcggc cccgcgagct 180 
aggatgaggg gccaggccgc cgccccgggc cccgtctgga tcctcgcccc gctgctactg 240 
ctgctgctgc tgctgggacg ccgcgcgcgg gcggccgccg gagcagacgc ggggcccggg 300 
cccgagccgt gcgccacgct ggtgcaggga aagttcttcg gctacttctc cgcggccgcc 360 
gtgttcccgg ccaacgcctc gcgctgctcc tggacgctac gcaacccgga cccgcggcgc 420 
tacactctct acatgaaggt ggccaaggcg cccgtgccct gcagcggccc cggccgcgtg 480 
cgcacctacc agttcgactc cttcctcgag tccacgcgca cctacctggg cgtggagagc 54 0 
ttcgacgagg tgctgcggct ctgcgacccc tccgcacccc tggccttcct gcaggccagc 600 
aagcagttcc tgcagatgcg gcgccagcag ccgccccagc acgacgggct ccggccccgg 660 
gccgggccgc cgggccccac cgacgacttc tccgtggagt acctggtggt ggggaaccgc 720 
aaccccagcc gtgccgcctg ccagatgctg tgccgctggc tggacgcgtg* tctggccggt 780 
agtcgcagct cgcacccctg cgggatcatg cagaccccct gcgcctgcct gggcggcgag 84 0 
gcgggcggcc ctgccgcggg acccctggcc ccccgcgggg atgtctgctt gagagatgcg 900. 
gtggctggtg gccctgaaaa ctgcctcacc agcctgaccc aggaccgggg cgggcacggc 960 
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gccacaggcg gctggaagct gtggtccctg tggggcgaat gcacgcggga ctgcggggga 1020 
ggcctccaga cgcggacgcg cacctgcctg cccgcgccgg gcgtggaggg cggcggctgc 1080 
gagggggtgc tggaggaggg tcgccagtgc aaccgcgagg cctgcggccc cgctgggcgc 114 0 
accagctccc ggagccagtc cctgcggtcc acagatgccc ggcggcgcga ggagctgggg 1200 
gacgagctgc agcagtttgg gttcccagcc ccccagaccg gtgacccagc agccgaggag 12 60 
tggtccccgt ggagcgtgtg ctccagcacc tgcggcgagg gctggcagac ccgcacgcgc 1320 
ttctgcgtgt cctcctccta cagcacgcag tgcagcggac ccctgcgcga gcagcggctg 1380 
tgcaacaact ctgccgtgtg cccagtgcat ggtgcctggg atgagtggtc gccctggagc 14 4 0 
ctctgctcca gcacctgtgg ccgtggcttt cgggatcgca cgcgcacctg caggcccccc 1500 
cagtttgggg gcaacccctg tgagggccct gagaagcaaa ccaagttctg caacattgcc 1560 
ctgtgccctg gccgggcagt ggatggaaac tggaatgagt ggtcgagctg gagcgcctgc 1620 
tccgccagct gctcccaggg ccgacagcag cgcacgcgtg aatgcaacgg gccttcctac 1680 
gggggtgcgg agtgccaggg ccactgggtg gagacccgag actgcttcct gcagcagtgc 1740 
ccagtggatg gcaagtggca ggcctgggcg tcatggggca gttgcagcgt cacgtgtggg 1800 
gctggcagcc agcgacggga gcgtgtctgc tctgggccct tcttcggggg agcagcctgc 1860 
cagggccccc aggatgagta ccggcagtgc ggcacccagc ggtgtcccga gccccatgag 1920 
atctgtgatg aggacaactt tggtgctgtg atctggaagg agaccccagc gggagaggtg 1980 
gctgctgtcc ggtgtccccg caacgccaca ggactcatcc tgcgacggtg tgagctggac 2040 
gaggaaggca tcgcctactg ggagcccccc acctacatcc gctgtgtttc cattgactac 2100 
agaaacatcc agatgatgac ccgggagcac ctggccaagg ctcagcgagg gctgcctggg 2160 
gagggggtct cggaggtcat ccagacactg gtggagatct ctcaggacgg gaccagctac 2220 
agtggggacc tgctgtccac catcgatgtc ctgaggaaca tgacagagat tttccggaga 2280 
gcgtactaca gccccacccc tggggacgta cagaactttg tccagatcct tagcaacctg 2340 
ttggcagagg agaatcggga caagtgggag gaggcccagc tggcgggccc caacgccaag 24 00 
gagctgttcc ggctggtgga ggactttgtg gacgtcatcg gcttccgcat gaaggacctg 24 60 
agggatgcat accaggtgac agacaacctg gttctcagca tccataagct cccagccagc 2520 
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ggagccactg 
gtgccagagg 
gatgaagcat 
gccctgcaga 
ccgcctcgct 
accaaccaga 
cagctcgggc 
cgctgcctct 
aacatggaga 
ctcaccctgc 
cgttctgtca 
atcgggcaga 
ttcttcttcc 
gtgacgggcc 
ctccctgcac 
atgaactact 
gctgccgttg 
aaagacggca 
tgcgtggtgc 
cgccgct ccg 
atcgtcatgg 
gttgaccggc 
cagctcatga 
gacatcgcgg 
gaggaggaga 
ccggccaacg 



acatcagctt 
acagggtcac 
ccgtgtttgt 
ggaacacgac 
ccctgcgcac 
cctgtatcct 
cctggtcgtg 
gtgaccggct 
aggcgactct 
tcatgctggt 
tcctcatcaa 
cccagacccg 
tgtcctcctt 
acctccggaa 
tggttgtggc 
gctggctctc 
tgctggtgaa 
tcacggacaa 
tgccgctgct 
ccctcttcca 
tgcactgtat 
aggaggaggg 
ccgacttcga 
cctgccgcac 
agctgaagct 
tgt ccaagct 



ccccatgaag 
tgtgtccaag 
ggtgggcacc 
cgtcctgaat 
acccttggag 
gtgggatgag 
gcgcggctgc 
ctccaccttc 
gccgtcggtg 
catcatctac 
cttctgcctg 
caacaaggtg 
ctgctgggtg 
ccgcctcatc 
catttctgtg 
cctggagggg 
catggtcatt 
gaagctgaag 
ggcgctgacc 
gatcctcttc 
cctccgtaga 
caacggggac 
gaaggacgtg 
tgccaccatc 
ggcccatgcc 
gcacctgcac 
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ggctggcggg 
agtgtcttct 
gtgctctaca 
tctaaggtga 
atcgagtttg 
acggatgtac 
cgcacggtgc 
gccatcttag 
acgctcatcg 
gtgtccgtgt 
tccatcatct 
atgtgcacgc 
ctcaccgagg 
cgcaagcgct 
ggattcacca 
ggactgctct 
gggatcctgg 
gagcgggcag 
tggatgtcgg 
gctgtcttcg 
gaggtccagg 
tcagggggct 
gatctggcct 
acgggcacac 
aaggggccgc 
ggctcacccc 



ccacgggtga 
ccacggggct 
ggaacctggg 
tctccgtgac 
cccacatgta 
cctcctcctc 
ccctcgacgc 
cccagctcag 
tgggctgtgg 
ggaggtacat 
cctccaatgc 
tggtggccgc 
cctggcagtc 
tcctctgcct 
aggccaaagg 
atgccttcgt 
tgttcaacaa 
gggcctccct 
ctgtgctcgc 
actcgctgga 
acgctgtgaa 
ccttccagaa 
gtagatcagt 
tgaagcggcc 
ccaccaattt 
gctatcccgg 



ctgggccaag 
gacagaggcc 
cagcttcctg 
tgtgaaaccc 
taatggcacc 
cgcccccccg 
cctccggacg 
cgccgacgcg 
cgtgtcctct 
tcgctcagag 
cctcatcctc 
cttcctgcac 
ctacatggcc 
gggctggggg 
gtacagcacc 
gggacctgcc 
gctcgtgtcc 
gtggagctcc 
cgtcaccgac 
gggcttcgtc 
atgccgtgtg 
cggccacgcc 
gctgaacaag 
gtctctgccc 
caacagcctg 
cgggcccctg 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
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cccgacttcc ccaaccactc actgaccctc aagagggaca aggcgcccaa gtcctccttc 4140 
gtcggtgacg gggacatctt caagaagctg gactcggagc tgagccgggc ccaggagaag 4 200 
gctctggaca cgagctacgt gatcctgccc acggccacgg ccacgctgcg gcccaagccc 4 260 
aaggaggagc ccaagtacag catccacatt gaccagatgc cgcagacccg cctcatccac 4 320 
ctcagcacgg cccccgaggc cagcctcccc gcccgcagcc cgccctcccg ccagcccccc 4 380 
agcggcgggc cccccgaggc accccctgcc cagcccccac cgcctccgcc cccaccgcca 4 4 40 
ccacctcccc agcagcccct gcccccaccg cccaatctgg agccggcacc ccccagcctg 4 500 
ggggatcccg gggagcctgc cgcccatccg ggacccagca cggggcccag caccaagaac 4 560 
gagaatgtcg ccaccttgtc tgtgagctcc ctggagcggc ggaagtcgcg gtatgcagaa 4 620 
ctggactttg agaagatcat gcacacccgg aagcggcacc aagacatgtt ccaggacctg 4 680 
aaccggaagc tgcagcacgc agcggagaag gacaaggagg tgctggggcc ggacagcaag 4 74 0 
ccggaaaagc agcagacgcc caacaagagg ccctgggaga gcctccggaa agcccacggg 4 800 
acgcccacgt gggtgaagaa ggagctggag c'cgctgcagc cgtcgccgct ggagcttcgc 4 8 60 
agcgtggagt gggagaggtc gggcgccacg atcccgctgg tgggccagga catcatcgac 4 920 
ctccagaccg aggtctgagc gggtgggcgg cggccacgca ctgggccacg gaggagggat 4 980 
gctgctccgc ccgctcctgc cgcagacggg cacagacacg ctcgcgggca gcgggccagg 5040 
cccgcacccc ggcctcaggg cgctcagacg gcggccaggc acagggcccg cagtgctggg 5100 
accagagcca gatgcaggac aggaggcggc ccggccagcg ggcacagggc accagaggcc 5160 
gaaggtgcct cagactccgc cctcctcggg ccgaggccca gcgggcagat gggcggacgg 5220 
ctgtggaccg tggacaggcc cagcgcggcc agcgtcccag ggtacccgcc tgagctcctg 5280 
ctgcggagga gctgcctgct tggcccggcc ggcctggcac cgttttttaa acacccccat 534 0 
ccctcgggaa gcagccagct ccccacacct tccagggccc taggcccctc ctagacccag 54 00 
gtggagggca cagccctccg accctcatgg cccccagggg caggactgag tcccctccag 54 60 
gaagaagcag gggggaatct attttttctc tccttttctt ttcttcaata aaaagaatta 5520 
aaaacccaaa aaaaa 5535 
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<210> 68 
<211> 398 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 68 

cggggcaacc cgctggagtg gacgggccag gtgacggtgc gcaagaagcg caagccctac 60 

tccaagttcc agacgctcga gctcgagaag gagttcctct tcaacgcgta cgtcagcaag 120 

cagaagcgct gggagctggc gcgcaacctc aacctcaccg agcgccaggt caagatctgg 180 

ttccagaacc ggcgcatgaa gaacaagaag aacagccagc gccaggcggc cagcagcagc 24 0 

agcagcaaca gcagcagcag cagcagcagc aacagcagca agcggccgcc ggcggggcgt 300 

cggccgccgc caacggccac cagggccacc aagcgcacca ccacgcgccc cccaacggcg 360 

ccgtcgcagc cctcaagcac caccagtgac ccgtagcg 398 

<210> 69 
<211> 8670 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 69 

cccgggtgcg gtgtcgtgtg tggggctggg cgccatgttc ctggacatgc tgagggccaa 60 

gcgcgacacg gcgcccgacc gccgccagct ggacgaccgg atgatggggg cggacccggg 120 

ggacatagcg gccaaggtga gggcagggtt ttgcgtgcgt gcttgattgt gcgtgtgcgt 180 

gcgtgcgtgc gtgcgtgcgg tgttgcgtgt gtatttgaac tgtgttttgt gtatgtactt 24 0 

aggggtaaga gtgcatacac atgcatgcga ccggtggcct tacaaatcaa caacacgtac 300 

gcctgcatgt atccaggtgg cagcgtggcg acgagcacgt ggcttcgagg gcccaggcac 360 

ggcgggcccc agcggcagcg ccgccagtgg cagcggcgcc agcggctcgg caccgcaggc 420 

gcgctcgccc cgacctcagc caccgcggcc gcgctcacct tcacgcgggt gaaccccggc 4 80 

gaggagccgc ccgtgtacgc gtgcgagcaa acaggtgcgt aagcgacgtg tgggcagcgc 540 

gaagaggcgt gggggcgaga gagcaaaggg actagggaaa cgcacagcca aatacggtat 600 
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gcgggcaacg aggcgatggc cctggaaatc 
aattgctggg cgactaccgt agtctactga 
actgcagtag tgccgttgcc agccgcgctg 
ggcccatgcc tcgcgcactc cgagcaccca 
ccgccgccct ctcccccgcc ttcacttctt 
ccgcttcctc aggctgggtg cacgtgtgcg 
ccgcagcgaa ctgctggtgt gcccggtgag 
agcgcgtgtt tctcagggcc tgtgtgggac 
gaactgtgta tgtggagtag cgtggcgtgt 
tgccccacgt tgcagagtca gctgacaacg 
gcacgcacgt cctccgcact cccaacaaat 
ctatgcccgc acacgctccc gcgccctcag 
cacgagtgcg aggaggatga ggcgggcggc 
ggaggacggc ctgggactgg gcatcacagg 
gggccttgta cgtgactgcg gttctgcatg 
gtacggcttg tgggcatgca gtgccttgac 
ttgccccgct ccctgcacac actgcagccg 
agtgcgagaa cgcgcagcag ctcaacaggc 
ccgcaggggg agtgtgttca tattgtggtt 
gtattgcacg cgtacagcat tggtgactgg 
aggcgctgca gcgagttgtg gggcatgcgt* 
gttgagttgg cgtttggagg tgtgggacga 
agaaggcgtg accgttttac gagctcgtat 
atttttgtta caagcgacga atgcgtgggg 
ttgcgcatgc acacgtgacc aaatttatgc 
ttgtgtatcg gttggcactt ataacaattg 



gcagggccct tttgaaatcg tgtaaggcgc 660 
tgcattgcac tacttgtatt actgtatcct 720 
ctgccctttg gctcccttcc caatccaaat 780 
gagcacccag aagccgttgc gtgcgctccg 840 
aattaatcgt gaatgtaatc cccccccccc 900 
cgacgcctgc acggagggtg tggtggatgc 960 
tcgacgagga ggaggtgcaa gggggatacc 1020 
accgaaacgt ggtaaaagag acccgcccgc 1080 
gcggccggac cgacaaggca gcttgtggac 114 0 
acacgtgcgc cttcctgtca ttgcccgtgc 1200 
tgacagcgac acgtgcgcct tcctataagc 1260 
gtgtcgggcc agaccacaga ccggttggtc 1320 
tgcggcggcg ccggcggggc gccgcggcga 138 0 
tgggtggcag gctggcaggg actcacgcat 14 4 0 
gctagtggct cacgcgctgc gcacgttcac 1500 
gtgaggctgc gctgccttgc tgctgccgcc 1560 
gcttcgggcg ctacttcacc gcgggctacg 1620 
tgctggggta caaggcgctg tgagagcgcg 1680 
gtttgggccg tgggcgcggg ctgcatgtgc 174 0 
tcaggtgtaa gcggccggca gtgcgccgcg 1800 
catgcgcaga cggcccctgg acgacaaggc 1860 
cgtggggttt gtgccgtcaa agcacagaac 1920 
gatgtagcat ggattgaata atgacatgt'g 1980 
ttttggatgg caggggtttc agtcgcccga 204 0 
tcaacgacgt gaccattgct ttatacatac 2100 
gctcgtcaaa ttgacgcgag gctgcacttc 2160 
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gatcctgaas 


gccccagttc 


aacaagtcgg 


atagccaaat 


ggccccgctc 


gctct ccagc 


2220 


at caaggggc 


ct ct aagtgc 


ctcgcggcaa 


cccagcgcaa 


gtgtgctcgc 


gttgcggtga 


2280 




tgcacttgt c 


gacgccgtcg 


gcaccgcaat 


cgaaagacgc 


gtgcgtcgag 


2340 


*w a C3. ^ l_ U v» *J y 


agccgctgac 


gaattgtccg 


catgtgacat 


tgcaggctcg 


cgtccccgct 


2400 


catctcaaca 


t catggccca 


qqtqcggacg 


ttgggactgc 


act tgcacga 


atgtgatggg 


2460 


gccgcsccgs 


otctqcQcqg 


acgtctcgct 


gacgtttcgc 


gttgaatgca 


tctcgcaata 


2520 


ggcagctgct 


crcacctqctq 


acaacactaa 


gaagctgtgg 


ggcggtcgct 


tcacgggcaa 


2580 


aaccrcracccci 

U t-i v*- 


ct ca t ggaga 


agttcaacga 


gtcgctgccc 


tttgacaagc 


gcctgtgggc 


2640 
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gqgcgaqtgg 

~ -3 J ZJ ZJ ZJ ZJ 




ctggggggga 


2700 




ataaccaaaq 


caqqcjaggga 


agacgtgcgg 


ggctaggcaa 


gaggctgcga 


2760 




taacaccaga 


ccqtgccgtq 


tcgcgtgccc 


ggcttgctgc 


ccaccttgcc 


2820 


cggccstccc 


caccgccctc 


cccaccagca 


atgacacgta 


cacattcaca 


cactccccca 


2880 


cacccacata 


cccacacacc 


cacgcattcc 


ccaacagggc 


agccaggcgt 


acgccaaggc 


2940 


tcttgccaag 


gccggcatt c 


tggcacatga 


cgaggccgtg 


accattgtgg 


aggggctggc 


3000 


l« ao <j ^ l. y v^. y \— 


acacccanca 


qcaqcfQCQQQ 


tqqqtqqgtg 


ggtggggtgg 


gggggcagag 


3060 




nrtaaoaonQ 

u I* a *H A %^ 


qactqaqaqq 


qqqqtcaqcg 


aggcgcaggc 


tcagggggag 


3120 


gcgt ctgagg 


oaq qctqaqa 
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tgqtggtggg 
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ggagctgcgg 


gtgctggggc 


tgctgcggtg 


3180 




qqqcggqcqq 


gcgacgtgta 


cgtgagtagc 


cgctgaccgg 


gcgctgggcc 


3240 


t tt gcgcacg 


ccacagccca 


catgacaccg 


ccgcaaggcc 


cgccgcgccc 


cacccacgtt 


3300 


cacacactcc 


ccacacccac 


gcgtgcgcgc 


gcctccttcc 


cctcaataca 


cgcgcctcct 


3360 


tcccctggcc 


cccgcctgct 


ccccccatcc 


ggccgccccg 


cctgcaggtg 


gctgaggagt 


3420 


ggaaggcggg 


tgcctttgtg 


atcaaggcgg 


gtgacgagga 


cat ccacacg 


gccaacgagc 


"5 a p n 


ggcgcctcac 


ggagctggtg 


ggggcggtgg 


gcggcaagct 


gcacaccggc 


cgctcgcgca 


3540 


acgaccaggt 


gagggtgggt 


gggtgggggt 


ggggtgggtg 


ggtgggtggg 


tgggtgggtg 


3600 


ggtgggtggg 


tgggtgggtg 


ggtgggtggg 


ggtttgagat 


accggtacca 


ggccaaacta 


3660 


aaccgaaccc 


aagggggtgg 


cgtaggggcg 


tgggaggggg 


ggagtgcgga 


agccgggagg 


3720 
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caggagtaag ggcgggagga gggggccgga ggagaagcag ggacgaagtc gatgacaggc 3780 
gcagtcggtg gcggcggtgg cgggtgtgcc gttgtgcagt ggctgtggag gccatgtgca 3840 
gggcggcggc ggggccgggc cgggggtggg agacttgtcc agaccccgtg gccctcttcc 3900 
agccccgtcc gccactgccg ccaccaccac cgccgccgcc gtagccacca cccctcacgt 3960 
cgaggcactt cacagatgcg aagcaaccac accgttctcc acatgaacag ctaccctccc 4 020 
aaacccaact ttcccttccc gccttaccta accatgaccc gctacccccc ccccctttat 4 080 
ttcttaacta accatgaatg cccccccccg gctgtacctg gctacgactt cacttcgtaa 4140 
acttaatgtg tgtaaccccc cttacacaca cacacacacc cctccccgcc cctccaaagg 4200 
ttgccaccga ctaccggctg tggctggtgg gtcaggtgga ggtgatgcgg tccgaggtgg 4260 
gcgagctgat gcgcgtggcg gcggaccgct ccgaggcaga ggtggaggtg ctcatgccgg 4 320 
gtgagggggc agggaggggg ggagggggag ggggaggtgc tcatgccggt gagggtaggg 4 380 
aggggagggg cagaggaggg agggggagga gggggcggct gagtgcggga gaggcaggga 44 40 
tgagggcgat agaaagttgc gtattgtcgg taaactcaaa ggactagacg aagagaacaa 4500 
acctaaacaa gggagctgga gcgaggccaa atctgaacgt gacatcgccc gcctcctccc 4 560 
gctgcctgct cccccacctc ctcccccatc tcgccccccc ccccacacac acacaggctt 4 620 
cacgcacctt cagaatgcca tgactgtgcg ctggagccac tggctgatga gccacgccgc 4 680 
ggcctggcag cgcgacgaca tgcggctgcg ggacctgctg ccgcgggtgg ccacactgcc 4740 
gctgggctcg ggtgggtgag ggaggggagg ggaggggagg gggggagggg gagggagagg 4 800 
aggggagaag ggggggggag acgaggaggg tggaagggtg ggggcggggc ggtggaggct 4 8 60 
agagggtggg gctgggtggg tggacggagt gcactggtag aggagggata gggtacattg 4 920 
agacgggagg agggatgcag gggcgaaggt ggggaggagg ggaggggagg aggcgtggag 4 980 
ctggagtggg ccgacgagtg tgcggacggg gcaggcggca acggggatta aacggcgggg 504 0 
ggccggggcg tgtgcacgac aggggcttgc gcgtctgcga ttgtgggggc? acacagggac 5100 
aggagcacga cgtgggacac gcatagatac gccgcattga caacacacac acacacacac 5160 
acacacacac acacacacac acacacacaa acacaaacac acacaaacac aaacacacac 5220 
acgccccccc ccctacacac acgccccctc cccaggcgcc ctggccggca acccctttct 5280 
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ggtggaccgc cagttcatcg ccaaggagtt gggtttcggc ggcggcgtgt gccccaactc 534 0 
catggacgcg gtgaggggag gaggaggggg aggagggcgg gggggggcag gaggggggag 54 00 
gaggaggggg ggagggggtt aactttgaag cgtaaggaaa cagtcgggag gaggggggga 54 60 
aggagggggc ctggaggagg gggggaggag gagggtggct ggagggggct gggggaggag 5520 
gagggggagg attgggaggg ggctggggga gggtgcccgc agctggggga ggtggggagg 5580 
gagggggttg ctgctggtgt aaagggcctg taggcactga gagcactgtg gggagccggg 564 0 
gtactgcctg gggccccgcg ctgcagaggt gtcgcgcagt gtggcggcgc atcccccgca 5700 
tccccacacg cgggccgctg ccgctgcccg ccacaccctt gccactttgt gtgctttcct 57 60 
aggatataca cacacacaca cacacacaca cacacacaca cacacacaaa cacaaacaca 5820 
cacgggcgcg ggctttcgtt tcgtt tttta acacaaacac acactccccc tgtgctcctc 5880 
aacacactcc atctttctca cacaaacaca cacgcacaca cacatgcgca ggtgtctgac 594 0 
cgcgactttg tgatcgagac ggtgtttgcg gccagcctgc tgtgcgtgca cctgtcgcgc 6000 
tgggcggagg acctcatcat ctacagctcc ggccccttcg gctacgtgca gtgcagcgac 6060 
gcctacgcca ccggctcctc gctcatgccg cagaagaaga accccgacgc cctggagctc 6120 
atcaggtgcg ggagggatgg ggtgggggtg ggggggttac attcatggtt agttaagaag 6180 
tgaaggcgta gggggtggat ggggtgggtt acattcatga acatttaaga agtgaaggcg 6240 
tagccaggaa cagtagtaga gcagacgcgt tgtagtgtgt gggtttgggt gggagggatg 6300 
gttgggtaaa gcggtacagg atgtactgag gactgcagac cgaaggagcg ggggaggggg 6360 
agcaggcagg cggggcgagg ggcgtggggg cgggggttac tggcaccgtg ccgggtaagc 6420 
aacacgtgac acggagatgc accacacaaa gagggacgtg gggagtggca ggcgggggcc 64 80 
agggctgaga ggcgcgtgtg gaggggtgcg gggttgggcg gggggctgtt tcatgatacc 654 0 
gctgcctcca cctcctccac cgcctcctgc cacctccacc tcccccactg cccctccccg 6600 
cctcctcctg ctgcaggggc aagggcggtc gtgtgcaggg caacctgatg ggcgtcatgg 6660 
cggtgctcaa gggcacgccc accacataca acaaggactt ccaggcgaga gagcgagagc 6720 
gagggaggga gggagagcga gggagaggga gggagaggga gggagaggga gacagaggga 6780 
cagggacagg gacagggaca gggacaggga cagggacagg ggcaggggca ggggcagggg 6840 
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caggggcagg ggcaggggag gccccccggg ggcggcgggc ccggggcatg aggtcagaca 6900 
taggggcgct gcactgaggc cgcgaggcgg gcgggaggca gggggcgggg ggcggggggc 6960 
gggagcggac atgcgccgca aacacagacg ggttgagaaa gcacaacgac tggaacgcag 7020 
tgggcttact gacaattcat cattgtgcgc atatgtgtgt atgtgtatgt gtgtgtttgt 7080 
ttgtgcagga gtgttgggag ctgctgtttg acacggtgga cacggtgcac gacgtggtgc 7140 
gcatcgccac cggcgtgctg tccaccctgc ggatcaagcc cgaccgcatg aaggccggtg 7200 
agcgtagccg agcagggctg gagcagcagc cgggcagcag tagcagcagg gcaggggagc 7260 
agcgggagcg ggagcagcag gaggggtggt tgggaagcgg tgggggtagg gtgggagcgg 7320 
aggaagggaa ggaggagcag gagcaggagg aagaggagga ggaagggcgg tggggggtgg 7380 
ggggtcgtgt ccttggccgc atgggcggag gcggggaggc ggggaggagg cggggaagca 74 4 0 
gagcctgcac ccacgctccg cgggtcccta ccgtcttgcg cctaaccccg tgcgcctagc 7500 
ctcttgcgcc caccccctta gtgcatcctg tacccctctt tccaaacatc cttgcaactc 7560 
cctgacctcc tcgccaaacc tcccccgccc ccaggcctgt ccgccgacat gctcgccacg 7 620 
gacttggccg agtacctggt gcgcaagggc gtgccgttcc gggagacaca ccaccacagg 7 680 
tgcggccggg cgggagggcg tgagggcgtg ggtggggcat gcccggggtt gtgagagcta 774 0 
tcgaacgttg tgccgcgcct gtttcacaat gtcgggccac agggtatgca gtttcctctc 7800 
catatgtata acaaactgac caccaatcat gcacgctcac acgctctccc acacacacgc 7 8 60 
gcaccacgcc accacagcgg cgccgccgtg aagatggccg aggaccgcgg ctgcacgctg 7 920 
ttcgacctca ccgtggacga cctcaagacc atccacccgc tcttcaccga cgacgtggcg 7980 
gcggtgagcg gcggcgcgga gcagcagcag cagcagcagc agcagcagca gcagcagcag 8040 
tagcctgggg gggagcgtgt gggaggaacg gcgggggagg ggaggcgggg ggtgtcgttt 8100 
gcagccgagc gcacgtggtg ctttgcccca ttccatgcca gcagggtgac acacctgacc 8160 
atgctggtgt gctgctaggt ggttcacacc tacgtgtgaa tttgtgctgg cgtgcgcaca 8220 
ccttactgtg gccatgtgaa cggcatcctc atgtcctcgt gattgcgccc ggcacattgc 8280 
ccacaacccc gcaccaccca gctcctcaat ccagtgcaag gaaaggaaat gcacgcccgc 834 0 
cgcaccaaca acacgacgca tgtgtttgcc acgtgcgcgc acacacgcgc aggtgtggga 84 00 
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cttcaaccgc agcgccgaga tgcgcgacac ggagggcggc accagcaagc gctcggtgct 84 60 

ggagcaggtg cagaagatgc gcacctacct ggcggcggag ggacagcact gagcgggtcg 8520 

ggggaggggg ggcgggtgtg tatgtgtgtg tgtgtgcgtg tgtaagtctc ggtggagggg 8580 

tggtcctcta tatggcggcg gggccacagg gggacgggtg tgacagagtt acggccggag 8640 

ccagcggagt cccgggatgg attaaggatc 8670 

<210> 70 
<211> 745 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism : Unknown 
<400> 70 

atgagatggc gacgcgcccc gcgccgctcc gggcgtcccg gcccccgggc ccagcgcccc 60 
ggctccgccg cccgctcgtc gccgccgctg ccgctgctgc cactactgct gctgctgggg 120 
accgcggccc tggcgccggg ggcggcggcc- ggcaacgagg cggctcccgc gggggcctcg 18 0 
gtgtgctact cgtccccgcc cagcgtggga tcggtgcagg agctagctca gcgcgccgcg 240 
gtggtgatcg agggaaaggt gcacccgcag cggcggcagc agggggcact cgacaggaag 300 
gcggcggcgg cggcgggcga ggcaggggcg tggggcggcg atcgcgagcc gccagccgcg 360 
ggcccacggg cgctggggcc gcccgccgag gagccgctgc tcgccgccaa cgggaccgtg 4 20 
ccctcttggc ccaccgcccc ggtgcccagc gccggcgagc ccggggagga ggcgccctat 4 80 
ctggtgaagg tgcaccaggt gtgggcggtg aaagccgggg gcttgaagaa ggactcgctg 540 
ctcaccgtgc gcctggggac ctggggccac cccgccttcc cctcctgcgg gaggctcaag 600 
gaggacagca ggtacatctt cttcatggag cccgacgcca acagcaccag ccgcgcgccg 660 
gccgccttcc gagcctcttt cccccctctg gagacgggcc ggaacctcaa gaaggaggtc 720 
agccgggtgc tgtgcaagcg gtgcg 74 5 



<210> 71 
<211> 1986 
<212> DNA 
<213> Unknown 
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<220> 

<223> Description of Unknown Organism : Unknown 
<400> 71 

gaattccttt tttttttttt ttttttcttt ttttttttgc ccttatacct cttcgccttt 60 
ctgtggttcc atccacttct tccccctcct cctcccataa acaactctcc tacccctgca 120 
cccccaataa ataaataaaa ggaggagggc aaggggggag gaggaggagt ggtgctgcga 180 
ggggaaggaa aagggaggca gcgcgagaag agccgggcag agtccgaacc gacagccaga 24 0 
agcccgcacg cacctcgcac catgagatgg cgacgcgccc cgcgccgctc cgggcgtccc 300 
ggcccccggg cccagcgccc cggctccgcc gcccgctcgt cgccgccgct gccgctgctg 360 
ccactactgc tgctgctggg gaccgcggcc ctggcgccgg gggcggcggc cggcaacgag 4 20 
gcggctcccg cgggggcctc ggtgtgctac tcgtccccgc ccagcgtggg atcggtgcag 480 
gagctagctc agcgcgccgc ggtggtgatc gagggaaagg tgcacccgca gcggcggcag 54 0 
cagggggcac tcgacaggaa ggcggcggcg gcggcgggcg aggcaggggc gtggggcggc 600 
gatcgcgagc cgccagccgc gggcccacgg gcgctggggc cgcccgccga ggagccgctg 660 
ctcgccgcca acgggaccgt gccctcttgg cccaccgccc cggtgcccag cgccggcgag 720 
cccggggagg aggcgcccta tctggtgaag gtgcaccagg tgtgggcggt gaaagccggg 78 0 
ggcttgaaga aggactcgct gctcaccgtg cgcctgggga cctggggcca ccccgccttc 84 0 
ccctcctgcg ggaggctcaa ggaggacagc aggtacatct tcttcatgga gcccgacgcc 900 
aacagcacca gccgcgcgcc ggccgccttc cgagcctctt tcccccctct ggagacgggc 960 
cggaacctca agaaggaggt cagccgggtg ctgtgcaagc ggtgcgcctt gcctccccaa 102 0 
ttgaaagaga tgaaaagcca ggaatcggct gcaggttcca aactagtcct tcggtgtgaa 1080 
accagttctg aatactcctc tctcagattc aagtggttca agaatgggaa tgaattgaat 114 0 
cgaaaaaaca aaccacaaaa tatcaagata caaaaaaagc cagggaagtc agaacttcgc 1200 
attaacaaag catcactggc tgattctgga gagtatatgt gcaaagtgat cagcaaatta 1260 
ggaaatgaca gtgcctctgc caatatcacc atcgtggaat caaacgctac atctacatcc 1320 
accactggga caagccatct tgtaaaatgt gcggagaagg agaaaacttt ctgtgtcaat 1380 
ggaggggagt gcttcatggt gaaagacctt tcaaacccct cgagatactt gtgcaagtgc 144 0 
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ccaaatgagt ttactggtga tcgctgccaa aactacgtaa tggccagctt ctacagtacg 1500 

tccactccct ttctgtctct gcctgaatag gagcatgctc agttggtgct gctttcttgt 1560 

tgctgcatct cccctcagat tccacctaga gctagatgtg tcttaccaga tctaatattg 1620 

actgcctctg cctgtcgcat gagaacatta acaaaagcaa ttgtattact tcctctgttc 1680 

gcgactagtt ggctctgaga tactaatagg tgtgtgaggc tccggatgtt tctggaattg 1740 

atattgaatg atgtgataca aattgatagt caatatcaag cagtgaaata tgataataaa 1800 

ggcatttcaa agtctcactt ttattgataa aataaaaatc attctactga acagtccatc 18 60 

ttctttatac aatgaccaca tcctgaaaag ggtgttgcta agctgtaacc gatatgcact 1920 

tgaaatgatg gtaagttaat tttgattcag aatgtgttat ttgtcacaaa taaacataat 1980 
aaaagg 1986 

<210> 72 
<211> 2003 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (32) 

<223> May be any nucleic acid 
<400> 72 

ggaattcctt tttttttttt tttttttctt nntttttttt tgcccttata cctcttcgcc 60 
tttctgtggt tccatccact tcttccccct cctcctccca taaacaactc tcctacccct 120 
gcacccccaa taaataaata aaaggaggag ggcaaggggg gaggaggagg agtggtgctg 180 
cgaggggaag gaaaagggag gcagcgcgag aagagccggg cagagtccga accgacagcc 240 
agaagcccgc acgcacctcg caccatgaga tggcgacgcg ccccgcgccg ctccgggcgt 300 
cccggccccc gggcccagcg ccccggctcc gccgcccgct cgtcgccgcc gctgccgctg 360 
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ctgccactac tgctgctgct ggggaccgcg gccctggcgc cgggggcggc ggccggcaac 420 
gaggcggctc ccgcgggggc ctcggtgtgc tactcgtccc cgcccagcgt gggatcggtg 4 80 
caggagctag ctcagcgcgc cgcggtggtg atcgagggaa aggtgcaccc gcagcggcgg 540 
cagcaggggg cactcgacag gaaggcggcg gcggcggcgg gcgaggcagg ggcgtggggc 600 
ggcgatcgcg agccgccagc cgcgggccca cgggcgctgg ggccgcccgc cgaggagccg 660 
ctgctcgccg ccaacgggac cgtgccctct tggcccaccg ccccggtgcc cagcgccggc 720 
gagcccgggg aggaggcgcc ctatctggtg aaggtgcacc aggtgtgggc ggtgaaagcc 7 80 
gggggcttga agaaggactc gctgctcacc gtgcgcctgg ggacctgggg ccaccccgcc 840 
ttcccctcct gcgggaggct caaggaggac agcaggtaca tcttcttcat ggagcccgac 900 
gccaacagca ccagccgcgc gccggccgcc ttccgagcct ctttcccccc tctggagacg 960 
ggccggaacc tcaagaagga ggtcagccgg gtgctgtgca agcggtgcgc cttgcctccc 1020 
caattgaaag agatgaaaag ccaggaatcg gctgcaggtt ccaaactagt ccttcggtgt 1080 
gaaaccagtt ctgaatactc ctctctcaga ttcaagtggt tcaagaatgg gaatgaattg 114 0 
aatcgaaaaa acaaaccaca aaatatcaag atacaaaaaa agccagggaa gtcagaactt 1200 
cgcattaaca aagcatcact ggctgattct ggagagtata tgtgcaaagt gatcagcaaa 1260 
ttaggaaatg acagtgcctc tgccaatatc accatcgtgg aatcaaacgc tacatctaca 1320 
tccaccactg ggacaagcca tcttgtaaaa tgtgcggaga aggagaaaac tttctgtgtg 1380 
aatggagggg agtgcttcat ggtgaaagac ctttcaaacc cctcgagata cttgtgcaag 1440 
tgcccaaatg agtttactgg tgatcgctgc caaaactacg taatggccag cttctacagt 1500 
acgtccactc cctttctgtc tctgcctgaa taggagcatg ctcagttggt gctgctttct 1560 
tgttgctgca tctcccctca gattccacct agagctagat gtgtcttacc agatctaata 1620 
ttgactgcct ctgcctgtcg catgagaaca ttaacaaaag caattgtatt acttcctctg 1680 
ttcgcgacta gttggctctg agatactaat aggtgtgtga ggctccggat gtttctggaa 174 0 
ttgatattga atgatgtgat acaaattgat agtcaatatc aagcagtgaa atatgataat 1800 
aaaggcattt. caaagtctca cttttattga taaaataaaa atcattctac tgaacagtcc 1860 
atcttcttta tacaatgacc acatcctgaa aagggtgttg ctaagctgta accgatatgc 1920 
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acttgaaatg atggtaagtt aattttgatt cagaatgtgt tatttgtcac aaataaacat 1980 
aataaaagga aaaaaaaaaa aaa 2003 



<210> 73 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (809) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (810) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (811) 

<223> May be any nucleic acid 



<400> 73 
tctcgcccca 


actttttccc 


ccgcgctccg 


cagcagcagc 


agcagcagca 


gcagcagcag 


60 


caaaatggca 


gacctcttca 


gcggactcgt 


gggcggcgtc 


gtcggcgctg 


ttgctgcagc 


120 


agatttgcct 


gcggagggcg 


agagggcccc 


ccgccccgcc 


cccggcactg 


cctggacttg 


180 


ctgctgcagc 


aaactgcaag 


aaggggcccg 


cgagctggag 


ggttttgtgc 


agcagctgag 


240 


ttttgttgca 


gggaagctgg 


cctgctgcct 


gcgggtgggg 


gcggagcagc 


tggcgcgctg 


300 


cgctgcggag 


gggcggctgc 


ccagcagcag 


cagcagcagc 


agctgctgcg 


cgctgctgca 


360 


gctcgagaag 


caggacctcg 


agcagagcct 


cgaggccggc 


aagcagggcg 


cggagtgcct 


420 


cttgaggagc 


agcaaactgg 


ccctcgaggc 


cctcctcgag 


ggggcccgcg 
* 


ttgcagcaac 


480 


gcggggtttg 


ctgctggtcg 


agagcagcaa 


agacacggtg 


ctgcgcagca 


ttccccacac 


540 


ccaggagaag 


ctggcccagg 


cctacagttc 


tttcctgcgg 


ggctaccagg 


gggcagcagc 


600 


ggggaggtct 


ctgggctacg 


gggcccctgc 


tgctgcttac 


ggccagcagc 


agcagcccag 


660 
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cagctacggg gcgccccccg cctccagcca gcagccctcc ggcttcttct ggtagccctg 720 
cagcagcagc agcagcagca gcagcagcag cagcgcgggc ggcagccgcg gcggggccgg 780 
ggcgccgctg cagcaacagc agcagccgnn ncggctagcg ccgcggagca ctcgcaggga 84 0 
actccacagg cagcgggaga gcagcaggga cgagaagcag gtcatgtagc gcaggcagca 900 
gcgccagctg cagcagcagc agcagcagca gcagcagcag cagcagctcc tgcaccg 957 



<210> 74 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (809) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (810) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (811) 

<223> May be any nucleic acid 

tctcgclcca actttttccc ccgcgctccg cagcagcagc agcagcagca gcagcagcag 60 
caaaatggca gacctcttca gcggactcgt gggcggcgtc gtcggcgctg ttgctgcagc 120 
agatttgcct gcggagggcg agagggcccc ccgccccgcc cccggcactg cctggacttg 180 
ctgctgcagc aaactgcaag aaggggcccg cgagctggag ggttttgtgc agcagctgag 24 0 
ttttgttgca gggaagctgg cctgctgcct gcgggtgggg gcggagcagc tggcgcgctg 300 
cgctgcggag gggcggctgc ccagcagcag cagcagcagc agctgctgcg cgctgctgca 360 
gctcgagaag caggacctcg agcagagcct cgaggccggc aagcagggcg cggagtgcct 420 
cttgaggagc agcaaactgg ccctcgaggc cctcctcgag ggggcccgcg ttgcagcaac 480 
gcggggtttg ctgctggtcg agagcagcaa agacacggtg ctgcgcagca ttccccacac 54 0 
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ccaggagaag ctggcccagg cctacagttc tttcctccgg ggctaccagg gggcagcagc 600 

ggggaggtct ctgggctacg gggcccctgc tgctgcttac ggccagcagc agcagcccag 660 

cagctacggg gcgccccccg cctccagcca gcagccctcc ggcttcttct ggtagccctg 720 

cagcagcagc agcagcagca gcagcagcag cagcgcgggc ggcagccgcg gcggggccgg 780 

ggcgccgctg cagcaacagc agcagccgnn ncggctagcg ccgcggagca ctcgcaggga 84 0 

actccacagg cagcgggaga gcagcaggga cgagaagcag gtcatgtagc gcaggcagca 900 

gcgccagctg cagcagcagc agcagcagca gcagcagcag cagcagctcc tgcaccg 957 



<210> 75 

<211> 1089 

<212> DNA 

<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (376) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (377) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (847) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (848) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (849) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (850) 

<223> May be any nucleic acid 
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<400> 75 

gaattccctc caactcttcg cgactctctc tctctcgccc caactttttc ccccgcgccc 60 
cgcagcagca gcagcagcag cagcagcaaa atggcagacc tcttcagcgg actcgtgggc 120 
ggcgtcgtcg gcgctgttgc tgcagcagat ttgcctgcgg agggcgagag ggccccccgc 180 
cccgcccccg gcactgcctg gacttgctgc tgcagcaaac tgcaagaagg ggcccgcgag 240 
ctggagggtt ttctgcagca gctgagtttt gttgcaggga agctggcctg ctgcctgcgg 300 
gtgggggcgg agcagctggc gcgctgcgct gcggaggggc ggctgcccag cagcagcagc 360 
agcagcagct gctgcnngct gctgcagctc gagaagcagg acctcgagca gagcctcgag 420 
gccggcaagc agggcgcgga gtgcctcttg aggagcagca aactggccct cgaggccctc 4 80 
ctcgaggggg cccgcgttgc agcaacgcgg ggtttgctgc tggtcgagag cagcaaagac 54 0 
acggtgctgc gcagcattcc ccacacccag gagaagctgg ctcaggccta cagttctttc 600 
ctgcggggct accagggggc agcagcgggg aggtctctgg gctacggggc ccctgctgct 660 
gcttacggcc agcagcagca gcccagcagc tacggggcgc cccccgcctc cagccagcag 720 
ccctccggct tcttctggta gccctgcagc agcagcagca gcagcagcag cagcagcagc 780 
ggcggcggca gccgcggcgg ggccggggcg ccgctgcagc aacagcagca gccgcggcgg 840 
ctagcgnnnn gagcactcgc agggaactcc acaggcagcg ggagagcagc agggacgaga 900 
agcaggtcta tgtagcgcag gcagcagcgc cagctgcagc agcagcagca gcagcagcag 960 
cagcagcagc agctcctgca ccgcagcgtt gtgtcattta ttacgttggc agctctgagg 1020 
cctcggcgca gccaacgcgc ctcaggtatc tttcagactc ttttctctaa ggtcttccag 1080 

1089 

acggaattc 



<210> 76 
<211> 1985 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 76 

cgccgagctt tcggcacctc tgccgggtgg taccgagcct tcccggcgcc ccctcctctc 60 
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ctcccaccgg cctgcccttc cccgcgggac tatcgccccc acgtttccct cagccctttt 120 
ctctcccggc cgagccgcgg cggcagcagc agcagcagca gcagcaggag gaggagcccg 180 
gtggcggcgg tggccgggga gcccatggcg tacagtcaag gaggcggcaa aaaaaaagtc 24 0 
tgctactact acgacggtga tattggaaat tattattatg gacagggtca tcccatgaag 300 
cctcatagaa tccgcatgac ccataacttg ctgttaaatt atggcttata cagaaaaatg 360 
gaaatatata ggccccataa agccactgcc gaagaaatga caaaatatca cagtgatgag 420 
tatatcaaat ttctacggtc aataagacca gataacatgt ctgagtatag taagcagatg 4 80 
catatattta atgttggaga agattgtcca gcgtttgatg gactctttga gttttgtcag 540 
ctctcaactg gcggttcagt tgctggagct gtgaagttaa accgacaaca gactgatatg 600 
gctgttaatt gggctggagg attacatcat gctaagaaat acgaagcatc aggattctgt 660 
tacgttaatg atattgtgct tgccatcctt gaattactaa agtatcatca gagagtctta 720 
tatattgata tagatattca tcatggtgat ggtgttgaag aagcttttta tacaacagat 780 
cgtgtaatga cggtatcatt ccataaatat ggggaatact ttcctggcac aggagacttg 840 
agggatattg gtgctggaaa aggcaaatac tatgctgtca attttccaat gtgtgatggt 900 
atagatgatg agtcatatgg gcagatattt aagcctatta tctcaaaggt gatggagatg 960 
tatcaaccta gtgctgtggt attacagtgt ggtgcagact cattatctgg tgatagactg 1020 
ggttgtttca atctaacagt caaaggtcat gctaaatgtg tagaagttgt aaaaactttt 1080 
aacttaccat tactgatgct tggaggaggt ggctacacaa tccgtaatgt tgctcgatgt 1140 
tggacatatg agactgcagt tgcccttgat tgtgagattc ccaatgagtt gccatataat 1200 
gattactttg agtattttgg accagacttc aaactgcata ttagtccttc aaacatgaca 1260 
aaccagaaca ctccagaata tatggaaaag ataaaacagc gtttgtttga aaatttgcgc 1320 
atgttacctc atgcacctgg tgtccagatg caagctattc cagaagatgc tgttcatgaa 1380 
gacagtggag atgaagatgg agaagatcca gacaagagaa tttctattcg agcatcagac 1440 
aagcggatag cttgtgatga agaattctca gattctga,gg atgaaggaga aggaggtcga 1500 
agaaatgtgg ctgatcataa gaaaggagca aagaaagcta gaattgaaga agataagaaa 1560 
gaaacagagg acaaaaaaac agacgttaag gaagaagata aatccaagga caacagtggt 1620 
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gaaaaaacag ataccaaagg aaccaaatca gaacagctca gcaacccctg aatttgacag 1680 

tctcaccaat ttcagaaaat cattaaaaag aaaatattga aaggaaaatg ttttcttttt 1740 

gaagacttct ggcttcattt tatactactt tggcatggac tgtatttatt ttcaaatggg 1800 

actttttcgt ttttgttttt ctgggcaagt tttattgtga gattttctaa ttatgaagca 1860 

aaatttcttt tctccaccat gctttatgtg atagtattta aaattgatgt gagttattat 1920 

gtcaaaaaaa ctgatctatt aaagaagtaa ttggcctttc tgagctgaaa aaaaaaaaaa 1980 
aaaag 1985 

<210> 77 
<211> 476 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 77 

ccaccctcct ccccctcccc cggccacttc gctaacttgg tggctgttgt gatgcgtatt 60 
cctgtagatc cgagcaccag ccggcgcttc agccccccct ccagcagcct gcagcccggc 120 
aaaatgagcg acgtgagccc ggtggtggct gcgcaacagc agcagcaaca gcagcagcag 180 
caacagcagc agcagcagca gcaacagcag cagcagcagc aggaggcggc ggcggcggct 24 0 
gcggcggcag cggcggctgc ggcggcggca gctgcagtgc cccggttgcg gccgccccac 300 
gacaaccgca ccatggtgga gatcatcgcc gaccacccgg ccgaactcgt ccgcaccgac 360 
agccccaact tcctgtgctc ggtgctgccc tcgcactggc gctgcaacaa gaccctgccc 420 
gtggccttca aggtaagagg ctaccccgcc ccccgccccc ggccgggagc ggcgga 47 6 



<210> 78 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA Primer 
<400> 78 

gcattttgga tccgcctttt catg 24 
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<210> 79 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA Primer 

<400> 79 22 
gttgtgtgct gcagattgtt cc 



<210> 80 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA Primer 
<400> 80 

gaaaaatggg gatccgaggt g 



<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA Primer 
<400> 81 

gcaggagaat tccgtccatg 



<210> 82 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (3) 

<22 3> Can be any amino acid 



<400> 82 

Trp Ser Xaa Trp Ser 
1 5 
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<210> 83 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 83 

Cys Ser Val Thr Cys Gly 
1 5 



<210> 84 
<211> 5 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> (4) 

<223> Can be any amino acid 
<400> 84 

Gly Cys Gin Xaa Arg 
1 5 



<210> 85 
<211> 733 
<212> DNA 

<213> Homo sapiens 



<400> 85 

gggatccgga gcccaaatct tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 

aattcgaggg tgcaccgtca gtcttcctct tccccccaaa acccaaggac accctcatga 120 

tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 180 

tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 240 

aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact 300 

ggctgaatgg caaggagtac aagtgcaagg tctccaacaa agccctccca acccccatcg 360 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 

catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 480 

atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 540 

ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 600 

acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 
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acaaccacta cacgcagaag agcctctccc tgtctccggg taaatgagtg cgacggccgc 720 

733 

gactctagag gat 



<210> 86 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA Primer 
<400> 86 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 

o c. 

cccgaaatat ctgccatctc aattag 



<210> 87 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA Primer 
<400> 87 

gcggcaagct ttttgcaaag cctaggc 27 



<210> 88 
<211> 271 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR Fragment 

<400> 88 ■ £n 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 60 

aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 120 

gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 180 

ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 240 

ttttggaggc ctaggctttt gcaaaaagct t 271 



<210> 89 
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<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 89 

gcgctcgagg gatgacagcg atagaacccc gg 



<210> 90 

<211> 31 

<212> DNA 

<213> Homo sapiens 

<400> 90 

gcgaagcttc gcgactcccc ggatccgcct c 



31 



<210> 91 

<211> 12 

<212> DNA 

<213> Homo sapiens 

<400> 91 
ggggactttc cc 



<210> 92 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 92 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 
ccatctcaat tag 7 ^ 



<210> 93 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR Fragment 



<400> 93 

gcggcaagct ttttgcaaag cctaggc 



27 
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What J$ Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide encoding a polypeptide comprising amino acids 1 to 950 in 
SEQ ID NO:2; 

(b) a polynucleotide encoding a polypeptide comprising amino acids 2 to 950 in 
SEQ ID NO:2; 

(c) a polynucleotide encoding a polypeptide comprising amino acids 29 to 950 in 
SEQ ID NO:2; 

(d) a polynucleotide encoding a polypeptide comprising amino acids 30 to 950 in 
SEQ ID NO:2; 

(e) a polynucleotide comprising a nucleotide sequence encoding the METH1 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 209581; 

(f) a polynucleotide comprising a nucleotide sequence encoding the mature 
METHl polypeptide having the amino acid sequence encoded by the cDNA 
clone contained in ATCC Deposit No. 20958 1 ; 

(g) a polynucleotide encoding a. polypeptide comprising amino acids 1 to 890 in 
SEQ ID NO:4; 

(h) a polynucleotide encoding a polypeptide comprising amino acids 2 to 890 in 
SEQ ID NO:4; 

(i) a polynucleotide encoding a polypeptide comprising amino acids 24 to 890 in 
SEQ ID NO:4; 

<j) a polynucleotide encoding a polypeptide comprising amino acids 1 12 to 890 in 
SEQ ID NO:4; 

(k) a polynucleotide comprising a nucleotide sequence encoding the METH2 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 209582; 

(1) a polynucleotide variant created by altering a polynucleotide of (a), (b), (c), 
(d), (e), (f), (g), (h), (i), 0), or (k), wherein: 

(i) said altering includes a nucleotide insertion, deletion, or substitution, or 
any combination thereof, and 
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(ii) the number of alterations is equal to or less than 5% of the total number 
of nucleotides present in the unaltered polynucleotide; 

(m) a polynucleotide encoding amino acids 235 to 459 in SEQ ID NO:2; 

(n) a polynucleotide encoding amino acids 460 to 544 in SEQ ID NO:2; 

(o) a polynucleotide encoding amino acids 545 to 598 in SEQ ID NO:2; 

(p) a polynucleotide encoding amino acids 841 to 894 in SEQ ID NO:2; 

(q) a polynucleotide encoding amino acids 895 to 934 in SEQ DO NO:2; 

(r) a polynucleotide encoding amino acids 536 to 61 3 in SEQ ED NO:2; 

(s) a polynucleotide encoding amino acids 214 to 439 in SEQ ID NO:4; 

(t) a polynucleotide encoding amino acids 440 to 529 in SEQ ID NO:4; 

(u) a polynucleotide encoding amino acids 530 to 583 in SEQ ID NO:4; 

(v) a polynucleotide encoding amino acids 837 to 890 in SEQ ED NO:4; 

(w) a polynucleotide encoding amino acids 280 to 606 in SEQ ID NO:4; 

(x) a polynucleotide encoding amino acids 529 to 548 in SEQ ID NO:4; and 

(y) a nucleotide sequence complementary to any of the nucleotide sequences in 
(a), (b), (c), (d), (e), (f), <g), (h), (i), (j), Q0> (0> (m), ("), (o), (p), (q), (r), (s), (t), (u), (v), 
(w), or (x). 

2. Ajx isolated nucleic acid molecule, comprising a polynucleotide selected from the 
group consisting of: 

(a) 50 contiguous nucleotides of the coding region of SEQ 3D NO: 1, provided that 
said nucleotide sequence is not any one of SEQ ID Nos:14-4l, or any 
sub fragment thereof; and 

(b) . a nucleotide sequence complementary to the nucleotide sequence in (a). 
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3. An isolated nucleic acid molecule, comprising a polynucleotide selected from the 
group consisting of: 

(a) 50 contiguous nucleotides of the coding region of SEQ ID NO:3, provided that 
said nucleotide sequence is not- SEQ ED Nos: 19-22, 24, 42-77, or any 
subfragment thereof; and 

(b) a nucleotide sequence complementary to the nucleotide sequence in (a). 

4. A method for making a recombinant vector comprising inserting an isolated nucleic 
acid molecule of claim 1 into a vector in operable linkage to a promoter. 

5 . A recombinant vector produced by the method of claim 4. 

6. A method of making a recombinant host cell comprising introducing the 
recombinant vector of claim 5 into a host cell. 

7 . A recombinant host cell produced by the method of claim 6. 

8. A recombinant method for producing a METH1 or METH2 polypeptide, comprising 
culturingthe recombinant host ceil of claim 7 under conditions such that said polypeptide 
is expressed and recovering said polypeptide. 

9. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting o£ 

(a) ' amino acids I to 950 in SEQ ID NO:2; 

(b) amino acids 2 to 950 in SEQ ID NO:2; 

(c) amino acids 29 to 950 in SEQ ID NO:2; 

(d) amino acids 30 to 950 in SEQ ID NO:2; 

(e) the amino acid sequence of the METH1 polypeptide having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209581; 
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(f) the amino acid sequence of the mature METH1 polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209581; 

(g) amino acids 1 to 890 in SEQ ID NO:4; 

(h) amino acids 2 to 890 in SEQ ID NO:4; 

(i) amino acids 24 to 890 in SEQ ID NO:4; 
(j) amino acids 1 12 to 890 in SEQ ID NO:4; 

(k) an amino acid sequence of the mature METH2 polypeptide having the amino 
acid sequence encoded by the METH2 cDNA clone contained in ATCC 
Deposit No. 209582; 

(1) the amino acid sequence of a polypeptide variant created by altering a 
. polypeptide of (a), (b), (c), (d), (e), (f), (g), (h), (i), Q), or (k), wherein: 

(i) said altering includes an amino acid insertion, deletion, or substitution, 
or any combination thereof; and 

(ii) the number of alterations is equal to or less than 5% of the total number 
of amino acids present in the unaltered amino acid sequence; 

(m) amino acids 235 to 459 in SEQ ID NO:2; 

(n) amino acids 460 to 544 in SEQ ID NO:2; 

(o) amino acids 545 to 598 in SEQ ID NO:2; 

(p) amino acids 841 to 894 in SEQ ID NO:2; 

(q) amino acids 895 to 934 in SEQ ID NO:2; 

(r) amino acids 536 to 6 1 3 in SEQ ID NO:2; 

(s) amino acids 2 14 to 439 in SEQ ID NO:4; 

(t) amino acids 440 to 529 in SEQ ID NO:4; 
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(u) amino acids 530 to 583 in SEQ ID NO:4; 
(v) amino adds 837 to 890 in SEQ ID NO:4; 
(w) amino acids 280 to 606 in SEQ ID NO:4; 
(x) amino acids 529 to 548 in SEQ ID NO:4; 
- 1 0. The isolated polypeptide of claim 9, whit* is produced in a recombinant host cell. 

11. The isolated polypeptide of claim 10, wherein said recombinant host cell is 
mammalian. 

12. An isolated nucleic acid molecule comprising a polynucleotide encoding a METH1 
or METH2 polypeptide wherein, except for one to fifty conservative amino add 
substitutions, said polypeptide has a sequence selected from the group consisting of: 

(a) amino acids from about 1 to about 950 in SEQ ID NO:2; 

(b) amino adds from about 2 to about 950 in SEQ ID NO:2; 

(c) amino acids from about 29 to about 950 in SEQ ID NO:2; 

(d) amino acids from about 30 to about 950 in SEQ ID NO:2; 

(e) the amino acid sequence of the METH1 polypeptide as encoded by the cDNA 
clone contained in ATCC Deposit No. 209581; 

(f) the amino add sequence of the mature METH1 polypeptide as encoded by the 
cDNA clone contained in ATCC Deposit No. 209581 ; 

(g) amino adds from about 1 to about 890 in SEQ ID NO:4; 

(h) amino adds from about 2 to about 890 in SEQ ID NO:4; 

(i) amino adds from about 24 to 890 in SEQ ED NO:4; 

(j) amino acids from about 1 12 to about 890 in SEQ ID NO:4; and 

(k) tibte amino add sequence of the METH2 polypeptide as encoded by the cDNA 
clone contained in ATCC Deposit No. 209582. 
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13. An isolated polypeptide wherein, except for one to fifty conservative amino acid 
substitutions, said polypeptide has a sequence selected from the group consisting of: 

(a) amino acids from about 1 to about 950 in SEQ ID NO:2; 

(b) amino acids from about 2 to about 950 in SEQ ID NO;2; 

(c) amino acids from about 29 to about 950 in SEQ ID NO:2; 

(d) amino acids from about 30 to about 950 in SEQ ID NO:2; 

(e) the amino acid sequence of the METHl polypeptide having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209581; 

(f) the amino acid sequence of the mature METHl polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209581; 

(g) amino acids from about 1 to about 890 in SEQ ID NO:4; 

(h) amino acids from about 2 to about 890 in SEQ BO NO:4; 

(i) amino acids from about 24 to about 890 in SEQ ID NO:4; 

0") amino acids from about 1 12 to about 890 in SEQ ED NO:4; 

(k) the amino acid sequence of the METH2 polypeptide having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209582; 

14. An isolated nucleic acid molecule comprising a polynucleotide at least 95% identical 
to a polynucleotide selected from the group consisting of: 

(a) a polynucleotide encoding a polypeptide comprising amino acids I to 950 in 
SEQ ID NO:2; 

(b) a polynucleotide encoding a polypeptide comprising amino acids 2 to 950 in 
SEQ ID NO:2; 

(c) a polynucleotide encoding a polypeptide comprising amino acids 29 to 950 in 
SEQ ID NO:2; 
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(d) a polynucleotide encoding a polypeptide comprising amino acids 30 to 950 in 
SEQ ID NO:2; 

(e) a polynucleotide comprising a nucleotide sequence encoding the METH1 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 20958 1 ; 

(f) a polynucleotide comprising a nucleotide sequence encoding the mature 
METH1 polypeptide having the amino acid sequence encoded by the cDNA 
clone contained in ATCC Deposit No. 20958 1 ; 

(g) a polynucleotide encoding a polypeptide comprising amino acids 1 to 890 in 
SEQE)NO:4; 

(h) a polynucleotide encoding a polypeptide comprising amino acids 2 to 890 in 
SEQIDNO:4; 

(i) a polynucleotide encoding a polypeptide comprising amino acids 24 to 890 in 
SEQIDNO:4; 

(j) a polynucleotide encoding a polypeptide comprising amino acids 1 12 to 890 in 
SEQ ID NO:4; 

(k) a polynncleoti^ a nucleotide sequence encoding the METH2 

polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 209582; 
(1) a nucleotide sequence complementary to any of the nucleotide sequences in 
(a), (b), (c), (d), (e), (f), (g), <h), (i), 0) or (k), wherein 
said % identity is calculated using the FASTDB computer program, with the 
parameters: Matrix-Unitary, k-tuple=4, Mismatch Penalty*!, Joining Penalty-30, 
Randomization Group Lengths, Cutoff Score- 1, Gap Penalty=5, Gap Size 
Penalty-0.05, Window SizcF=500 or the length of the subject nucleotide sequence, 
whichever is shorter. 

15. An isolated polypeptide comprising a polypeptide having 95% identity to a 
polypeptide having an amino acid sequence selected from the group consisting of: 

(a) amino acids from about 1 to about 950 in SEQ ID NO:2; 

(b) amino acids from about 2 to about 950 in SEQ ID NO:2; 

(c) amino acids from about 29 to about 950 in SEQ ID NO:2; 

(d) amino acids from about 30 to about 950 in SEQ ID NO:2; 

(e) the amino acid sequence of the METH1 polypeptide having the amino acid 
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sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209581; 

(f) the amino acid sequence of the mature METH1 polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209581; 

(g) amino acids from about 1 to about 890 in SEQ ID NO:4; 

(h) amino acids from about 2 to about 890 in SEQ ID NO:4; 

(i) amino acids from about 24 to about 890 in SEQ ID NO:4; 
(j) amino acids from about 1 12 to about 890 in SEQ ID NO:4; 

(k) the amino acid sequence of the METH2 polypeptide having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209582; 

and wherein said % identity is calculated using the FASTDB computer program, 
with the parameters: Matrix=PAM 0, k-tuple=2, Mismatch Penalty^!, Joining 
Penalty=-20, Randomization Group Length=0, Cutoff Score=l, Gap Penal ty=S, Gap 
Size Penalty=0.05, Window Size=500 or the length of the subject amino acid 
sequence, whichever is shorter. 

16. A method for inhibiting angiogenesis in an individual, comprising administering an 
effective amount of a polypeptide according to any one of claims 9, 13 and 15 to 
said individual. 

17. An isolated antibody of the polypeptide of any one of claims 9, 13 and 1 5. 

18. A method of identifying antagonists or agonists of the polypeptide of any one of 
claims 9, 13 and 15 comprising: 

(a) producing cells which express said polypeptide cither as a secreted protein or 
on the cell membrane; 

(b) contacting the polypeptide produced in step (a) with a test sample potentially 
containing an antagonist or agonist; and 

(c) identifying an antagonist or agonist by observing binding and stimulation or 
inhibition of activity of said protein. 

19. The nucleic acid molecule of claim 1 which is DNA or RNA. 
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20. A polynucleotide that is capable of hybridizing under stringent conditions to any one 
of the polynucleotides specified in claim 1(a) to l(y), wherein said polynucleotide 
capable of hybridizing under stringent conditions: 

(i) does not hybridize under stringent conditions to a nucleic acid molecule 
having a nucleotide sequence of only A residues or of only T residues; 

(ii) is not any one of SEQ ID NO: 1 4 to 77 or any subftagment thereof; and 

(iii) is at least 95% identical to any one of the polynucleotides specified in 
claim 1(a) to 1 (y). 

21. An in vitro method of diagnosing cancer in a subject comprising the step of 
determining the expression level of the nucleic acid molecules of claims 1 or 1 9. 

22. An in vitro method of diagnosing cancer in a subject comprising the step of 
determining the protein level of the polypeptide of any one of claims 9, 13 and 15. 

23. A pharmaceutical composition comprising the nucleic acid molecules of any one of 
claims 1 and 19, the polypeptide of any one of claims 9, 13 and 15 or the antibody of 
claim 17. 

24. A method for stimulating angiogenesis in an individual, comprising administering 
an effective amount of an antagonist of the polypeptide of any one of claims 9, 13 
and 15 to said individual. 

25. A method according to claim 24 wherein the antagonist is selected from the group 
comprising: an antibody, an oligonucleotide, a protein and a small molecule. 

26. A method for inhibiting angiogenesis in an individual, comprising administering an 
effective amount of an agonist of the polypeptide of any one of claims 9, 13 and 15 
to said individual. 

27. The antibody of claim 1 7, which is selected from the group consisting of: 

(a) a humanized antibody; 

(b) a single chain antibody; 

(c) a polyclonal antibody; 

(d) a monoclonal antibody; and 
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(e) a Fab fragment. 

28. A nucleic acid according to claim 1 substantially as herein before described with 
reference to any one of the examples. 

29. A method according to any one of claims 4, 6 or 8 substantially as herein before 
described with reference to any one of the examples. 

30. A recombinant vector according to claim 5 substantially as herein before described 
with reference to any one of the examples. 

31. A recombinant host cell according to claim 7 substantially as herein before 
described with reference to any one of the examples. 

Dated this FOURTH day of SEPTEMBER 2003. 

Tinman Genome Sciences^ Inc. and Beth Israel Deaconess Med ical Center 

Applicants 

Wray & Associates 

Perth, Western Australia 

Patent Attorneys for the Applicants 
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ATGGGGAACGCGGAGCXJGGCTCCGGGGTCTCGGAGCTTTGGGCCCGTACCCACGCTGCTGCTGCTCGCCGCGGCGCTA 
MGNAERAPGSRSFGPVPTLLLLAAAL 

CTGGCCGTGTCGGACGCACTCGGGCGCCCCTCCGAGGAGGACGAGGAGCTAGTGGTC^ 
LAVSDALGRPSEEDEELVVPELERAP 

GGACACGGGACCACGCGCCTCCGCCTGCACGCCTTTGACCAGCAGCTGGATCTGGAGCTGQXJC^ 
GHGTTRLRLHAFDQQLDLELRPDSSF 

TTGGCGCCCGGCTTCACGCT(£AGAACGTGGGGC 

LAPGFTLQNVGRKSGSETPLPETDLA 
CACTGCTTCTACTC(XX£AC(X;TGMTG€CGATCCCAGCT^ 

HCF YSGTVNGDPSSAAAL SLCEGV RG 

GCCTTCTACCTGCTGGGGGAGG(XTATTTCATCCAGCCGCTGCCrcC(XJCCAG(X5AGCGCCTCGCCACCGCCGCCCCA 
AFYLLGEAYF IQPLPAASERLATAAP 

GGGGAGMGCCGC(XX£ACCACTACAGTTCCACCTCCTC^ 

G E K P P A R_.U.Q -FHLLRRNROGDV G.G..T C.G 

GTCGTGGACGAtXJAGCCCCGGCCGACTGCGAAAGCGGAGACCGMGACGAG^^ 
VVODEPRPTGKAETEDEDEGTEGEDE 

GGGCCTCAGTGGTCGCCGCAGGACCGGGCACTGCAAGGCGTAGGACAGCCCA 

GPQWSPODPALQGVGQPTGTGS 1 RKK 

OGATTTGTGTCCAGTCACCGCTATGTGGAMCCATGCTTGTGGCAGACWGTCGATGGCAGMTTCCACGGCAGTGGT 
RFVSSHRYVETMLVADQSMAEFHGSG 

CTAAAGCATTACCTTCTCACGTTGTTTT(X£TGGCAGCC^ 
LKHYLLTLFSVAARLYKHPSI RNSVS 

CTGGTGGTGGTGAAGATCTTGGTCATCCACGATGAACAGMGGGGCCGGAAGTGACCTCCMTGCTGCCCTCACTCTG 
LVVVK 1LVIHDEQKGPEVTSNAAL TL 

CGGMCTTTTGCMCTGGCAGAAGCAGCACM^CCACXX^GTGACCGGGATGCAGAGCACTATGACACAGCAATTCTT 
RNFCNWQKQHNPPSDRDAEHYDTA I L 

TTCACCAGACAGGACTTGTGTGGGTCCCAGACATGTGATACTCTTGGGATGGCTGATGTTGGAACTGTGTGTGATCCG 
FTRQDLCGSQTCOTLGMADVGTVCDP 

AGCAGAAGCTGCTCCGTCATAGAAGATGATGGTTTACAAGCTGCCTTCACCACAGCCCATGAATTAGGCCACGTGTTT 
SRSCSV I EDOGLQAAF T TAHE LGH VF 

MCATGCCACATGATGATGCAAAGCAGTGTGCCAGCCTTAATGGTGTGAACCAGGATTCCCACATGATGGCGTCAATG 
NMPHDDAKQCASLNGVNQDSHMMASM 
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ctttccaacc7ggaccacagccagccttggtctccttgcagtgcctacatgattacatcatttctggataatggtcat 
lsnldhsqpwspcsaymi tsflongh 

ggggaatgtttgatggacaagcctcagaatcccatacagctcccaggcgatctccctggcacctcgtacgatgccaac 
geclmdkpqnp i qlpgdlpgtsydan 

cggcagtgccagtttacatttggggaggactccaaacactgccctgatgcagccagcacatgtagcaccttgtggtgt 
rocqf tfgedskhcpdaastcstlwc 

ac05gcacctctggtggggtgctggtgtgt(^(x^cacttcccgtgggoggatgg(^ccagctgt(^ 
tgtsggvlvcqtkhfpwadgtscgeg 

amtggtgtatcmcggcaagtgtgtgmcaaaaccgacagaaagcattttgata(xccttttcatggaagctgggga 
kwc 1ngkcvnktdrkhfdt. pfhgswg 

atgtgggggccttggggagactgttcgagaacgtgcx^tggaggagtccagtacacgatgagggaatgtgacaaccca 
mwgpwgdcsrtcgggvoytmrecdnp 

gtcccaaagaatggagggaagtactgtgaaggc^amcgagtgcgctacagatcctgtmccttgaggactgtccagac 
.. v .p__k_n _g gkycegkrvryrscnle 0 cp d 

mtmtggaaaaacctttagagaggmcmtgtgaagcacacmcgagttttcaaaagcttcctttgggagtgggcct 
nngktfreeqceahne fskasfgsgp 

gcggtggmtggattcccaagtacgctggq5tctcaccaaaggacaggtgcaagctcatctgccaagccaaaggcatt 
avewi pkyagvspkdrckl i cqakg i 

ggctacttcttcgttttgcagcccaaggttgtagatggtactccatgtagcccagattccacctctgtctgtgtgcaa 
gyffvlopkvvdgtpcspdstsvcvq 

ggacagtgtgtaaaagctggttgtgatcgcatcatagactccaaaaagaagtttgataaatgtggtgtttgcggggga 
gqcvkagcdr 1 i oskkkfdkcgvcgg 

mtggatctacttgtaaaaaaatatcaggatcagttactagtgcaaaacctggatatcatgatatcatcacaattcca 
ngstckk i sgsvtsakpgyhd i i t i p 

actggagccaccmcatcgmgtgamcagcggaaccagaggggatccaggaacaatggcagctttcttgccatcaaa 
tgatnievkqrnqrgsrnngsflaik 

gctgctgatggcacatatattcttaatggtgactacactttgtccaccttagagcaagacattatgtacaaaggtgtt 
aaogtyi lngdytlst leqd i mykgv 

gtcttgaggtacagcggctcctctgcggcattggaaagmttcgwgctttagccctctcaaagagcccttgaccatc 
vlrysgssaalerirsfsplkeplt i 

caojttcttactgtgggcmtgcccttcgacctaamttamtacacctactt(x;taaagmgaaga^ 
qvltvgnalrpk1kytyfvkkkkesf 
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AATGCTATCCCCACTTTTTCAGCATGGGTCATTGAAGAGTGGGGCGAATGTTCTAAGTCATGTGAATTGGGTTGGCAG 
NA I PTFSAWV1EEWGECSKSCE LGWQ 

AGAAGACTGGTAGAATGCCGAGACATTAATGGACAGCCTGCTTCCGAGTGTGCAAAGGAAGTGAAGCCAGCCAGCACC 
RRL VECRDI NGQPASECAKEVKPAST 

AGACCTTGTGCAGACCATCCCTGCCCCCAGTGGCAGCTGGGGGAGTGGTCATCATGTTCTAAGACCTGTGGGAAGGGT 
RPCADHPCPQWQLGEWSSCSKTCGKG 

TACAAAAAAAGAAGCTTGAAGTGTCTGTCCCATGATGGAGGGGTGTTATCTCATGAGAGCTGTGATCCTTTAAAGAAA 
YKKRSLKCLSHOGGVLSHESCDPLKK 

CCTAAACATTTCATAGACTTTTGCACAATGGCAGAATGCAGTTAAGTGGTTTAAGTGGTGTTAGCTTTGAGGCAAGGC 
PKHF IOFCTMAECS 

AMGTGAGGMGGGCTGGTGCAGGGAAAGCAAGAAGGCTGGAGGGATCCAGCGTATCTTGCCAGTMCCAGTGAX 
TATCAGTAA(£T(mTTAT(XXmAGATAGAAA^ 

AGGATAGTTAGTGAGGATTATTAACCTCTGAGCAGTGATATAGCATAATAAANCCCQIGGCATTATTATTATTATTTC 
TTTTGTTACATCTATTACAAGTTTAGAAAAMCAAAGCMTTGTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 
GCGGCCGCTCTAGAGGATttiCTGGAGGGGCCCAAGCTTACGCGTGCAtGNTGTCATNAGTCTN 
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atgttccccgcccccgcojccccccggtggcttccgttcctgctgctgctgctgct(x:tgctgctgccgctggcccgc 
mfpapaaprwlpflllllllllplar 

GGCGCXXCGGCCCGGCCCGCAGCCGGGGGGCAGGCCTCGGAGCTGGTGGTGCCCACGCGGTTGCCCGGCAGCGCGGGC 
GAPARPAAGGOASELVVPTRLPG SAG 

GAGCTCGCX5CTCCACCTGTCCGCX)TTCrcCAAGGGCTTCGTGTTGCGCCTGGCGCCCGACGACAGCTTCCTGGCGCCC 
ELALHLSAFGKGFVLRLAPDDSF LAP 

GAGTTCAAGATCGAGCGCCTCGGGGGCTCCGGCCGGGCGACCGGGGGCGAGCGGGGGCTGCGCGGCTGTTTTTTTTCC 
EFK IERLGGSGRATGGERGLRGCFFS 

GGCACCGTGMTGGGGAGCCCGAGTCGCTGGCGGCGGTCAGCCTGTGCCGCGGGCTGAGCGGCTCCTTCCTGCTGGAC 
GTVNGEPESLAAVSLCRGLSGSFLLD 

GGCGAGGAGTTCACCATCCAGCCGCAGGGCGCGGGGGGCTCCCTGGCTCAGCC^ 
GEEFT IQPOGAGGSLAQPHRLQRWGP 

GCCXXJAGCCCGCCCCCTCCCGCGAGGACCCGAGTGGGAGGTGGAGACGGGAGAGGGTCAGAGGCAGGAGAGAGGAGAC 
AGARPLPRGPEWEVETGEGQRQERGD 

CACCAGGAGGACAGCGAGGAGGAGAGCCAAGAAGAGGAGGCAGAAGGCGCTAGCGAGCCGCCA 
HQEDSEEE_SQEE_E_A_E G A SEPPPPLGA 

ACGAGTAGGACCAAGCGGTTTGTGTCTGAGGCGCGCTTCGTGGAGACGCTGCTGGTGGCCGATGCGTCCATGG^ 
TSRTKRFVSEARFVETLLVAOASMAA 

TTCTACGGGGCOGACCTGCAGAACCACATCCTGACGTTMTGTCTGTGGCAGCCCGMTCTACAAGCACCCCAGCATC 
FYGADLQNHILTLMSVAARIYKHPSl 

AAGMTTCCATCMCCTGATGGTGGTAAAAGTGCTGATCGTAGAAGATGAAAAATGGGGCCCAGAGGTGTCCGACAAT 
KNS 1NLMVVKVL 1VEDEKWGP. EVSON 

GGGGGGCTTACACTGCGTAACTTCTGCMCTGGCAGCGGCGTTTCAACCAGCCCAGCGACCGCCACCCAGAGCACTAC 
GGL TLRNFCNWQRRFNOPSDRHPEHY 

GACACGGCCATCCTGCTCACCAGACAGAACTTCTGTGGGCAGGAGGGGCTGTGTGACACCCTGGGTGTGGCAGACATC 
DTA I LLTRQNFCGQEG LCDTLGVAD I 

GGGACCATTTGTGACCCCAACAAAAGCTGCTCCGTGATCGAGGATGAGGGGCTCCAGGCGGCCCACACCCTGGCCC^ 
G T I CDPNKSCSV IEDEGLQAAHTLAH 

GMCTAGGGCACGTCCTCAGCATGCCCCACGACGACTCCAAGCCCTGCACACGGCTCTTCGGGCCCATGGGCM 
ELGHVLSMPHDDSKPC TRLFGPMGKH 

CACGTGATGGCACCGCTGTTCGTCCACCTGAACCAGACGCTGCCCTGGTCCCCCTGCAGCGCCATGTATCTCACAGAG 
HVMAPLFVHLNQTLPWSPCSAMYLTE 

CTTCTGGA(£GCGGGCAC(X;AGACTGTCTCCT«;ATGCC^ 

LLOGGHGDCLLDAPGAALPLPTGLPG 

CGCATGGCCCTGTACX^AGCTGGACCAGCAGTGCAGGCAGATCTTTGGXCGGATTTCCGCCACTGC(X;CAACACCTCT 
RMALYQLOOOCROI FGPDFRHCPNTS 

FIG.2A 
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gctc/^acgtctgcgcccagcttt«;tgccacactgatggggctgagcccctgtgccacacgaagaatggcagcctg 

AQDVCAQLWCHTDGAEPLCHTKNGSL 

CCCTGGGCTGACGGCACGCCGTGCGGGCCTGGGCACCTCTGCTCAGAAGGCAGCTGTCTACCTGAGGAGGAAGTGGAG 
PWADGTPCGPGHLCSEGSCLPEEEVE 

AGGCCCAAGCCCGTGGTAGATGGAGGCTGGGCACCGTGGGGACCCTGGGGAGAATGTTCTCGGACCTGTGGAGGAGGA 
RPKPVVDGGWAPWGPWGECSRTCGGG 

GTACAGTTTTCACACQJTGAGTGCAACXJACCCCGAGCCTCAGMTGGAGGAAGATACTGCCTGGGTCGGAGAGCCAAG 
VQFSHRECKDPEPQNGGRYCLGRRAK 

TACCAGTCATGCCACACGGAGGAATGCCCCCCTGACGGGAAAAGCTTCAGGGAGCAGCAGTGTGAGAAGTATAATGCC 
YQSCHTEECPPDGKSFREQOCEKYNA 

TACV^TTACACTGACATGGAO^MTCTCCTGCAGTG^TCCCCAAGTATGCTGGGGTGTCCCCCCGGGACCGCTGC 
YNYTOMOGNL LQWVPKYAGVSPRDRC 

MGTTGTTCTGCCGAGCCCGGGGGAGGAGCX5AGTTCAMGTGTTCGAGGCCAAGGTGATTGATGGCACCCTGTGTGGG 
KLFCRARGRSEFKVFEAKV1 DGTLCG 

CCAGAMCACTCGCCATCTGTGTCCGTGGCCAGTGTGTCM^ 

PETLAICV. RGOCVKAGCDHVVDSPRK 

CTGGACAMTGCGGGGTGTGTC&Gl^ 
L DKCGVCGG.KGNSCRKVSGSL TP TNY 

GGCTACAATGACATTGTCACCATCCCAGCTGGTGCCACTAATATTGACGTGAAGCAGCGGAGCCACCCGGGTGTGCAG 
GYND I VT IPAGATN IDVKQRSHPGVQ 

AACGATGGGAACTACCTGGCGCTGAAGACGGCTGATGGGCAGTACCTGCTCAACGGCAACCTGGCCATCTCTGCCATA 
NDGNYLALKTADGQYLLNGNLA I SAl 

GAGCAGGACATCTTGGTGAAGGGGACXXTCCTGAAGTACAGCG^ 
EQDILVKGTILKYSGSIATLERLQSF 

CGGCCCTTGCCAGAGCCTCTGA(^TGCAGCTCCTGACAGTCCCTGGCGAGGTCTTCCCCCCAAAAGTCAM 
RPLPEPLTVOLLTVPGEVFPPKVKYT 

TTCTTTGTTCCTMTGAttTGGACTTTAGCATGCAGAG^C^ 

FFVPNDVDFSMQSSKERATTN 1 IQPL 

CTCCAftCACAGTGGGTGCTGG&GACTGGTC^ 
LHAOWVLGDWSECSSTCGAGWQRRTV 

GAGTGCAGGGACCCCTCCGGCCAGGCCTCTGCCACCTGCAACAAGGCTCTGAMCCCGAGGATGCCAAGCCCTGCGM 
ECROPSGQASATCNKALKPEDAKPCE 

AGCCAGCTGTGCCXXXTGTGATTCAGGGGGGCAGGGGCCAGTCTTGTGCTCCTGGACATGCGGTACTGAGGTGCAGAC 
S Q L C P L 

AAGGTCTCCACTGTGGTGACTG&TCCCTTGGCCATATC^ 

CCCTCCAGTACTGCACAAATTCCTMGGGGGMGAGAAAAGGTATGGGGCGGCAAAACCTATCATCAACTGTCCAWTG 
NMTGGAACTTGCTCG&TTCMTTAMGGCAT/^TTA^ 
TTGCANGATACAACTANTAAAAAAAAAAAAAAAAAAAAAAAAAA - 
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FIG.7B 
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